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Abstract 

When we try to advance from a solid knowledge of field theory to LHC physics we usually encounter a frustrating 
problem: Higgs physics, QCD at the LHC, and phenomenological strategies appear as a impenetrable block of 
technical know-how, common lores, and historically grown intuition what works and what does not. These lectures 
should put you into a position to digest advanced writeups as well as first research papers on the topic and start your 
first LHC project. 
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User's manual 

These notes are based on lectures at Heidelberg University in the Summer 2009 and in the Winter 2010/2011, written 
up in coffee shops around the world. To those familiar with the German system it will be obvious that the target 
audience of the lecture are students who know field theory and are starting their diploma or master thesis; carefully 
studying these notes should put you into a position to start actual research in LHC physics. 

One thing to keep in mind when reading is that the lectures consist of four hours a week without exercises. This means 
I calculate everything in detail on the blackboard. This is why the notes look the way they look. Because this is not a 
text book there is less text in the notes than actual talk during the lecture. So when reading these notes, take a break 
here and there, get a coffee and think about the physics behind the calculation you just followed. 

The text is divided into three distinct parts: 

- In the first part we focus on Higgs physics and collider searches. To understand what we are looking for we start 
with the most minimalistic and not renormalizable models describing massive gauge bosons. We then slowly 
advance to the usual fundamental Higgs scalar we are really searching for At the end of this part what 
everybody should understand is the usual set of ATLAS or CMS graphs shown in Figure |6] where many colored 
lines represent different search channels and their discovery potential. Many QCD issues affecting Higgs 
searches we will skip in the Higgs part and postpone to the... 

- ...QCD part. Here, we are taking at least one step back and study the theory which describes Higgs production 
and everything else at the LHC. Two core discussions shape this part: first, we derive the DGLAP equation by 
constructing the splitting kernels. This leads us to the parton shower and to the physical interpretation of 
resumming different logarithms in the QCD perturbation series. Second, there are two modern approaches 
combining parton shower and matrix element descriptions of jet radiation, which we introduce at least on the 
level of simplified models. Throughout the QCD discussion we avoid the more historically interesting deep 
inelastic scattering process and instead rely on the Drell-Yan process or its inverted R ratio process for 
motivation and illustration. 

- Finally, there are many aspects of LHC physics we need to take into account simulating LHC events. Some of 
them, like old fashioned and fat jets, helicity amplitudes, or missing transverse energy we cover in the third part. 
This part will expand in the online version over the coming years while I will keep these lecture notes up to date 
with actual discussions of LHC data. 

What is almost entirely missing in this review is an introduction to searches for new physics completing the Standard 
Model of particle physics beyond the weak scale. Covering this topic appropriately would at least double the length of 
these notes. For the structure of such models and their signatures I instead refer to our review article Ref.lT] and in 
particular to its second chapter. 

Last, but not least, the literature listed at the end of each part is not meant to cite original or relevant research papers. 
Instead, I collected a set of review papers or advanced lecture notes supplementing these lecture notes in different 
directions. Going through some of these mostly introductory papers will be instructive and fun once the basics have 
been covered by these lecture notes. 



As the author of these notes I am confident that they are far from bug free. So if you think you read them very 
carefully and you did not email me at least a handful typos and complaints about poor explanations, you did not read 
them carefully enough. 
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1 HIGGS PHYSICS 



1 Higgs physics 

Understanding the nature of electroweak symmetry breaking — or slightly more specifically deciphering the Higgs 
mechanism — is the main goal of the ATLAS and CMS experiments at the LHC. Observing some kind of Higgs 
boson and studying its properties involves many experimental and theoretical issues focused around understanding 
hadron collider data and QCD predictions to unprecedented precision. The latter will be the main topic of the second 
half of this lecture. 

On the other hand, before we discuss the details of Higgs signatures, backgrounds, and related QCD aspects we 
should start with a discussion of electroweak symmetry breaking. Higgs physics at the LHC might well mean much 
more than just finding a light fundamental Higgs boson as predicted by the Standard Model of particle physics. 
Therefore, we prefer to follow an effective theory approach. This means we do not start by writing down the Higgs 
potential and deriving the measured gauge boson and fermion masses. Instead, we step by step include gauge boson 
and fermion masses in our gauge theories, see what this means for the field content, and show how we can embed this 
mechanism in a renormalizable fundamental gauge theory. Only this last step will lead us to the Standard Model and 
the Higgs potential. 



1.1 Electroweak symmetry breaking 

As a starting point, let us sketch the Standard Model Lagrangian and what we need to do to accommodate mass terms 
for gauge bosons and fermions. As a matter of fact, in a first step in Section [T.Ll| we will try to make our gauge 
bosons massive without introducing a physical Higgs field. Even for the SU{2) gauge theory of the electroweak 



Standard Model we might get away without a fundamental Higgs boson, as we will show in Section L L2 Only when 
we worry about quantum fluctuations of the relevant degrees of freedom we are led to the usual picture of the Higgs 
potential, the Higgs boson, and the symmetries related to its implementation. 



1.1.1 Massive photon 

Even though this is not the physical problem we are interested in, we start by breaking electrodynamics, i.e. giving a 
mass to the (massless) photon of our usual locally U (l)Q-symmetric Lagrangian. To its kinetic F ■ F term we would 
like to add a photon mass which we know is forbidden by the gauge symmetry. The way around is also 

adding an innocent looking real (uncharged) scalar field without a mass and without a coupling to the photon, but with 
a scalar-photon mixing term and a non-trivial gauge transformation. The result is called the Boulware-Gilbert model 

= -\F^.F'^'^+^-e^f(^A,,~jjd^^^ , (1.1) 

where / is a common mass scale for the photon mass and the mixing. It ensures that all terms in the Lagrangian have 
mass dimension four — remembering that bosonic fields like and </> have mass dimension one. The additional 
factor e will become the usual electric charge, but at this stage it is a dimensionless number without any specific 
relevance in this interaction-less Lagrangian. Because all terms in Eq.( |Ll[ i have mass dimension four our theory 
should be renormalizable. 

We can define a simultaneous gauge transformation of both fields in the Lagrangian 

A^^ A^ + ^d^X ^^<t> + x, (1-2) 

under which the Lagrangian is indeed invariant: the kinetic term for the photon we leave untouched, so it will be 
gauge invariant just as it was before. The simultaneous gauge invariance is then defined to keep the second term in 
Eq.(|Ll|i invariant. If we now re-define the photon field as = A^ — d^(j)/ (e/) we need to compare the new and the 
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old kinetic terms 



F„, 



and then rewrite the Lagrangian of Eq.( |l.l| i as 



\f,,F'^^ + \nilBl 



(1.3) 



(1.4) 



This Lagrangian effectively describes a massive photon field B^, which has absorbed the real scalar (f) as its additional 
longitudinal component. This is because a massless gauge boson A^ has only two on-shell degrees of freedom, 
namely a left handed and a right handed polarization, while the massive B^ has an additional longitudinal polarization 
degree of freedom. Without any fundamental Higgs boson appearing, the massive photon has 'eaten' the real scalar 
field (p. Of course, the new field B^ is not simply a photon with a mass term, because this is still forbidden by gauge 
invariance. Our way out is to split the massive photon field into the transverse degrees of freedom and the 
longitudinal mode </> with their different gauge transformations given by Eq.( 1.2 1. 



What kind of properties does this field (j) need to have, so that we can use it to provide a photon mass? From the 
combined gauge transformation in Eq.( |1.2[ ) we immediately see that any additional purely scalar term in the 
Lagrangian, like a scalar potential V{4)), needs to be symmetric under the linear shift + X' "ot to spoil gauge 
invariance. This means that we cannot write down polynomial terms 0", like a mass or a self coupling of 0. An 
interaction term (f)AA would not be possible, either Only derivative interactions proportional to d(f) coupling to 
conserved currents are allowed. In that case we can absorb the shift by x into a total derivative in the Lagrangian. 



This example illustrates a few vital properties of Nambu-Goldstone bosons (NGB). Such massless physical states 
appear in many areas of physics and are described by Goldstone's theorem : 

If a global symmetry group is spontaneously broken into a group of lower rank, its broken generators correspond to 
physical Goldstone modes. These fields transform non-linearly under the larger and linearly under the smaller group. 
They are massless and cannot form a potential, because the non-linear transformation only allows derivative terms in 
the Lagrangian. If the smaller symmetry is also broken, the NGBs become pseudo-NGBs and acquire a mass of the 
size of this hard-breaking term. 

In the special case that the spontaneously broken symmetry is a gauge symmetry and its breaking induces gauge boson 
masses, these massive degrees of freedom are made out of 'eaten' Goldstone modes. The gauge boson mass is given 
by the vacuum expectation value (VEV) breaking the larger symmetry. 



1.1.2 Standard Model doublets 

One of the complications of the Standard Model is its SU (2) doublet structure. In the last section we have chosen not 
to introduce a charged SU{2) doublet, which is why there are no degrees of freedom left after the photon gets its 
mass. This means that our toy model is not going to be well suited to provide the three degrees of freedom needed to 
make SU{2) gauge bosons massive. What it illustrates is only how by introducing a neutral scalar particle without an 
interaction but with a mixing term we make gauge bosons heavy, in spite of gauge invariance. 

Fermion fields have mass dimension 3/2, so we know how mass and interaction terms in the renormalizable 
dimension-4 Lagrangian have to look. For example, the interaction of fermions with gauge bosons is most easily 
written in terms of covariant derivatives . The terms 

^D4 = QLiVQL + Qr^VQb. + LlWLl + LriDLr - (1.5) 

describe electromagnetic interactions introducing a covariant derivative = 9^ + leqA^ with the photon field also 
appearing in the field strength tensor F^,^ = d^Ai, — d^A^. The same form works for the weak SU{2) interactions. 
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except that the weak interaction knows about the chirality of the fermion fields, so we have to distinguish E>l and Dr. 
The covariant derivatives we write in terms of the SU{2) basis matrices ti^2,3 or t+___3, with t± = (ri ± 1x2) /I. 



=d^ + ieqAf^ 



2 



B„ 



T3 



D 



RlJ.'- 



T+ = 



Tl = 



1 



1 

1 



T2 





1 

-i 

1 



'72 



T3 



1 

-1 



(1.6) 



The fields and VT" are the new gauge bosons. In the second line we re-write the covariant derivative in the mass 
eigenstates, i.e. the photon and the Z boson. At this level the two weak couplings g and gz do not necessarily 



coincide, but we will get back to this issue in Section 1.1.5 



Before we generalize the Boulware-Gilbert model to the weak gauge symmetry of the Standard Model it is instructive 
to review the form of the mass term for massive gauge bosons following from Eq.( |1.6| l. In particular, there will appear 
a relative factor two between the two bases of the Pauli matrices, i.e. in terms of W^''^ and W^, which often causes 
confusion. 



Products of the Pauli matrices Ti.2,3 satisfy the relation TiTj 
[tz, Tj] = 2ieijkTk. Summing over indices we see that 



i^ijkTk or the commutator relation 



ijk 



(1.7) 



The basis of three Pauli matrices we can write in terms of ri,2,3 as well as in terms of t-|_ - .3. The latter form of the 
generators corresponds to the two charged and one neutral vector bosons. While the usual basis is written in terms of 
complex numbers, the second set of generators reflects the fact that for SU{2) as for any SU {N) group we can find a 
set of real generators of the adjoint representation. When we switch between the two bases we only have to make sure 
we get the factors \/ 2 right 



V-2{r^W;^+r^W-)=V-2{ ° 



1 



V2 





w- 



wl 
K 



w;, 



(1.8) 



To track these numbers of two in the definitions of the weak gauge field we have a close look at is the dimension-2 
mass term for charged and neutral gauge bosons 



(1.9) 



The relative factor two in front of the W mass corresponds to the factors 1 / \/2 in the SU (2) generators t± . 
Of course, in the complete Standard Model Lagrangian there are many additional terms, e.g. kinetic terms of all 
kinds, but they do not affect our discussion of U{1)y and SU{2)l gauge invariance. 

Guessing the form of the fermion masses the one thing we have to ensure is that we combine the left handed and right 
handed doublets (Ql, i^) and singlets {Qr, Lr) properly: 



(1.10) 
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Following our labeling scheme by mass dimension fermion masses will be included as J/fos- Dirac mass terms simply 
link SU{2) doublet fields for leptons and quarks with right handed singlets and give mass to all fermions in the 
Standard Model. This helicity structure of mass terms we can easily derive by introducing left handed and right 
handed projectors 

1 - 75 , _ ^ , , a + 75 



IpL = 



(1.11) 



2 " 2 
where ijj is a generic Dirac spinor and Pl,r. are projectors in this 4x4 Dirac space. At this stage we do not need the 



explicit form of the gamma matrices which we will introduce in Eq.(2.105 1. The mass term for a Dirac fermion reads 



= ^ (Pi 

= (PflV)^7°(PL^) + (PLV')^7°(Pi?V') 

= (p^)]l(piv) + (pz?)l(pi?^) 



with 
with 
with 



7/> ^ V^7° 
{75, 7p} = 



4 



75,1 



L,R = '^L,R 



(1.12) 



while the kinetic term stays diagonal 



(PlV)^(PlV') + i^Rll^m^Ri^) 



(1.13) 



The link between the chiral projectors and coupling structures we discuss in Section pT] 

In general, these mass terms can be matrices in generation space, which implies that we might have to rotate the 
fermion fields from an interaction basis into the mass basis where these mass matrices are diagonal. Flavor physics 
dealing with such 3x3 mass matrices is its own field of physics with its own reviews and lecture notes, so we will 
omit this compUcation here. For our discussion of electroweak symmetry breaking it is sufficient to study one fermion 
generation at a time. 



The well known problem with the mass terms in Eq.( |1.10| i is that they are not gauge invariant. To understand this 
issue of fermion masses we check the local weak SU (2)^ transformation 



U{x) = exp (^—ia'^{x)^ 



-i(a-T)/2 



(1.14) 



which only transforms the left handed fermion fields and leaves the right handed fields untouched 



Ll ^ ULl 

u 



Lr^ L 



R 



Ql ^ UQl 
Qr ^ Qr ■ 



(1.15) 



It is obvious that there is no way we can make left-right mixing fermion mass terms as shown in Eq.( 1.10 1 invariant 
under this left handed SU{2) gauge transformation, where one of the fermion field picks up a factor U and the other is 
unchanged, 

Ql'^qQr ^ QLU^^mqQR ^ Ql^qQr ■ (1.16) 

In analogy to the massive photon case, to write a gauge-invariant Lagrangian for massive fermions we have to add 
something else to our minimal Standard Model Lagrangian. Note that this addition does not have to be a fundamental 
scalar Higgs field, dependent on how picky we are with the properties of our new Lagrangian beyond the issue of 
gauge invariance. 
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To see what we need to add let us also look at the local U{1) transformations involved. We start with a slightly 
complicated-looking way of writing the abelian hypercharge U{1)y and electric charge C/(1)q transformations, 
making it more obvious how they mix with the neutral component of SU{2) to give the electric charge. 
Let us start with the neutral component of the SU{2) transformation V — exp {if3T^/2). Acting on a field with an 
SU{2) charge this it not a usual U{1) transformation. What we can do is combine it with another, appropriately 
chosen transformation. This hypercharge transformation is proportional to the unit matrix and hence commutes with 
all other matrices 



exp(j/3g) = exp{i/3q) exp ^-^/?t"3 



exp ( 1/3 1 exp \~^PT3 



with 



exp ( i-yt 



(1.17) 



This relation between the charges is called the Gell-Mann-Nishijima formula. Acting on a left handed field the factor 
T3 above is replaced by its eigenvalue ±1 for up-type and down-type fermions. The U{1)y charges or quantum 
numbers y are the quark and lepton hypercharges of the Standard Model. As required by the above argument, properly 
combined with the isospin they give the correct electric charges Qq^l- Since r3 and the unit matrix commute with each 
other the combined exponentials have not additional factor a la Baker-Campbell-Hausdorff e^e^ = e'^+^e^'^'^l/^. 
In analogy to Eq.( |1.15| left handed and right handed quark and lepton fields transform under this U{1)y symmetry as 



Ll exp{i(3qL)V^LL = exp (^^^yL^ ] Ll 
Lr exp (iPqL) Lji 



Ql -> exp ii^QQ) V^Ql = exp ( i^yqt ) Ql 



}r exp (i 



(1.18) 



Under a combined SU{2)l and U transformation the left handed fermions see the hypercharge, while the right 
handed fermions only see the electric charge. Just as for the SU (2) transformation U we do not even have to compute 
anything to see that such different transformations of the left handed and right handed fermion fields do not allow for a 
Dirac mass term. 



1.1.3 Sigma model 

One way of solving this problem which at this point really looks like a cheap trick is to introduce an additional field 

This field will in some way play the role of the real scalar field we used for the photon mass generation. Its 
physical properties will become clear piece by piece from the way it appears in the Lagrangian and from the required 
gauge invariance. The equation of motion for the S field will also have to follow from the way we introduce it in the 
Lagrangian. 

Following the last section, we first introduce S into the fermion mass term to see what it takes to make this mass term 
gauge invariant under the weak SU (2) transformation 

Ql^tuqQr ^ Q^U'^Y.^'^^mQQR = Q^^mQQn ^ S ^ E^^) = C/E . (1.19) 

If the result should be a dimension-4 Lagrangian the mass dimension of E has to be m'^ = 1. The same we can do for 
the U{1)y transformation V 

QiY^rnqQu exp S^^^mg exp (i^g) Qr 

=Ql^^^^ exp (^-«^yl^ exp {iPq) itiqQr 
=Q^Y.'^''WmQQR 



exp ( i^yl ) always commuting 



(1.20) 
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which together with Eq.(1.19 1 gives us the transformation property we need 



(1.21) 



For any E with this property the part of the Lagrangian has the required U{1)y x SU(2)l symmetry, 
independent of what this field really means. From the way it transforms we see that S is a 2 x 2 matrix with mass 
dimension zero. In other words, including a S field in the fermion mass terms gives a U{1)y and SU{2) ^-invariant 
Lagrangian, without saying anything about possible representations of S in terms of physical fields 



■^D3 = -Qh^^nqQn - LLT.mLLii + h.c. 



(1.22) 



In the second step, we worry about the gauge boson masses, starting with the left handed covariant derivative 



(1.23) 



Instead of deriving the gauge transformation of S let us start with a well-chosen ansatz and work backwards step by 
step, as to check that we indeed arrive at the correct masses. 

First, we consistently require that the covariant derivative acting on the S field in the gauge-symmetric Lagrangian 
reads 

(1.24) 



If we introduce the abbreviations = and T = SraE^ we claim we can write the gauge boson mass terms 



as 



2 2 



(1.25) 



The trace acts on the 2x2 SU (2) matrices. The parameter Ap is conventional and will be the focus of Section 1. 1.5 
What we need to show is that this form is gauge invariant and gives the correct gauge boson masses. 



Another question is what additional terms of mass dimension four we can write down using the dimensionless field S 
and which are gauge invariant. Our first attempt of a building block 



(1.26) 



is forbidden by gauge invariance according to the SU (2) transformation of the sigma field, Eq.( 1.20 1. On the other 



hand, a circularly symmetric trace Tr(E^^E) Tt{VY,^'SV^) = Tr(E^E) changes this into a gauge invariant 
combination, which allows for the additional potential terms, meaning terms with no derivatives 



^s--^Tr(EtE) 



(Tr(EtE))' 



(1.27) 



with properly chosen prefactors p, v, A. This fourth term finalizes our construction of the relevant weak Lagrangian 

— ^_D2 + -%I3 + -^Di + , (1-28) 

organized by mass dimension. 

As rule of thumb we will later notice that once we express this potential in terms of the usual Higgs doublet the 
prefactors will just be p and A. The parameter p and the factor v appearing with every power of E have mass 
dimension one, while A has mass dimension zero. Higher dimensional terms in a dimension-4 Lagrangian are possible 
as long as we limit ourselves to powers of Tr(E^^E). However, they lead to higher powers in v which we will see 
makes them higher-dimensional operators in our complete quantum theory. 
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To check for the correct masses in the Standard Model we start with Tr(I]^^E) and assume it acquires a finite 
(expectation) value after we properly deal with E. The definitely simplest way to achieve this is to assume 



(1.29) 



This choice is called unitary gauge. It looks like a joke to introduce T,{x) = t and ask for gauge invariance. Clearly, a 
constant does not exhibit the correct transformation property under the U and V symmetries, but we can always work 
in a specific gauge and only check the final result for gauge invariance. Even the way the sigma field breaks our gauge 
symmetry we can schematically see from 

S C/EF^ = [/ly^ = C/F^ = 1 , (1.30) 

which requires U ^ V tohe the remaining C/(l) gauge symmetry after including the E field. Certainly, E = 1 gives 
the correct fermion masses in and makes the potential .Sfi; an irrelevant constant. What we need to check is 
which is supposed to reproduce the correct gauge boson masses. Using the covariant derivative from Eq.( |1.24| i acting 
on a constant field we can compute the auxiliary field in unitary gauge 

= {W;r+ + W-T_) - igzZjj- (1.31) 

with = CujW^ — SwB^ and the two coupling constants gz = gjcw and g' = gsw/cw What is not obvious from 
this argument is that we can actually write the ratio g' / g in terms of a rotation angle, which implicitly assumes that we 
can rotate the B and fields into the physical mass-eigenstate photon and Z fields 



^t^\_( \ ( B^ 



(1.32) 



The result in Eq.( |1.31[ ) gives us the first of the two terms in the gauge boson mass Lagrangian 



Tr[y^n = -2 ^W+W-'-TTir+T^) ^^Z,Z^T,{rl) 



= -g'^W+W-'^ - ^^Z^Z^ , (1.33) 

using r| = 0, Tr(r3T±) = 0, Trr^T:^ = 1, and Tr(T|) = Tr 1 = 2. The second mass term in proportional to 
Ap is equally simple in unitary gauge 

T = ErgEt = T3 
^ Tt{TV^) = Tr (^-igzZj-^^ = -igzZ^, 

^ Tr{TV^) Tr(ry^) - -glZ^Z^ . (1.34) 
Inserting both terms into Eq.( |1.25[ l yields the complete gauge boson mass term 

= {-g'w^w-'^ j-Z,Z^^ + Ap"^ {~glz,z^) 
= "^KW + "^Z.Z^ Ap'^Z.Z^ 

2 2 2 2 

= "^W+W-^ + ^ (1 - Ap) Z^Z^ . (1.35) 
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Identifying the masses with the form given in Eq.( 1.9 1 and assuming universality of neutral and charged current 
interactions (Ap = 0) we find 



mw = 



gv 



niz 



gzv Ap=o 5zw _ _gv_ 

2 2 ^iCqn 



(1.36) 



The role of a possible additional and unwanted Z-mass contribution Ap we will discuss in Section 1.1.5 on custodial 
symmetry. Given that we know the heavy gauge boson masses (mw ^ 80 GeV) and the weak coupling (g ^ 0.7) 
from experiment, these relations experimentally tell us w ^ 246 GeV. 



Let us just recapitulate what we did until now — using this E field and its finite constant value S = 1 in unitary gauge 
we have made the fermions and electroweak gauge boson massive. Choosing this constant finite field value for S is 
not the only and not the minimal assumption needed to make the gauge bosons heavy, but it leads to the most compact 
Lagrangian. From the photon mass example, however, we know that there must be more to this mechanism. We 
should for example be able to see the additional degrees of freedom of the longitudinal gauge boson modes if we step 
away from unitary gauge. 



If a finite expectation value of the terms in the potential should be linked to electroweak symmetry breaking and 
the gauge boson masses we can guess that the minimal assumption leading to finite gauge boson masses is 
(Tr(I]^^(a;)S](a;))) 7^ in the vacuum. Every parameterization of S with this property will lead to the same massive 



gauge bosons, so they are all physically equivalent 
choices. In the canonical normalization we write 



as they should be given that they are only different gauge 



1 



(Tr(I]^(.T)S](.T))) = 1 Vx 



which instead of our previous S(a;) = 1 we can also fulfill through 



(1.37) 



(1.38) 



This means that E(x) is a unitary matrix which like any 2x2 unitary matrix can be expressed in terms of the Pauli 
matrices. Even though this solution still forbids fluctuations which in the original condition on the expectation value 
only vanish on average, in contrast to = 1 it allows a non-trivial x dependence. The unitary matrix can be 
parameterized as 



= exp ( — w{x) 

V 



with w(x) — Wa{x)Ta , 



(1.39) 



where w{x) has mass dimension one which is absorbed by the mass scale v. These fields are a set of scalar 
Nambu-Goldstone modes . From the photon mass example for Goldstone's theorem we know that they will become 
the missing degrees of freedom for the three now massive gauge bosons and Z. The normalization scale 1; fixes 
the relevant energy scale of our Lagrangian. Because the relation between I] and w is not linear, this is referred to as a 
non-linear representation of the S field. Using the commutation properties of the Pauli matrices we can expand E as 



-ID - 



2 w2 

« - 1 

-W - —rWaWal 

V Iv^ 

1 



1 i 
6 



WaTaWbU + - —WaTaWbUWcTc + 0{w'^) 



6v^ 



WaWaW + 0{w^) 



using Eq.( |1.7| i 



1 

6v 



^WaWa + 0{w^) ] W . 



(1.40) 



From this expression we can for example read off Feynman rules for the longitudinal gauge fields w, which we will 
use later. 
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Yet another way of parameterizing the unitary field S in terms of the Pauh matrices is the properly normalized 
expression 



1 



1 + 



WaW„ 



1 



-w{x) 



(1.41) 



The different ways of writing the S field in terms of the Pauli matrices cannot have any impact on the physics. 

Before we move on and introduce a physical Higgs boson we briefly discuss different gauge choices and the 
appearance of Goldstone modes. If we break SU{2)l x U (l)y U{1)q we expect three Goldstone bosons which 
become part of the weak gauge bosons and promote those from massless gauge bosons (with two degrees of freedom 
each) to massive gauge bosons (with three degrees of freedom each). This is the point of view of the unitary gauge, in 
which we never see Goldstone modes. 

In the general (R — gauge we can actually see these Goldstone modes appear in the gauge boson propagators 



mi 



1 



—I 



= < 





-mlr 


+ ie 




—i 






- ml 


+ ie 




—i 








+ ie 



n^^ — 



unitary gauge ^ — > oo 



Feynman gauge ^ 
Landau gauge ^ = 



(1.42) 







Obviously, these gauge choices are physically equivalent. However, something has to compensate, for example, for 
the fact that in Feynman gauge the whole Goldstone term vanishes and the polarization stun looks like a massless 
gauge bosons, while in unitary gauge we can see the effect of these modes directly. The key is the Goldstone 
propagator, i.e. additional propagating scalar degrees of freedom 



(1.43) 



for both heavy gauge bosons {V = Z, The Goldstone mass \/^my depends on the gauge: in unitary gauge the 
infinitely heavy Goldstones do not propagate (Ayy (g^) — )• 0), while in Feynman gauge and in Landau gauge we have 
to include them as particles. From the form of the Goldstone propagators we can guess that they will indeed cancel the 
second term of the gauge boson propagators. 

These different gauges have different Feynman rules and Green's functions, even a different particle content, so for a 
given problem one or the other might be the most efficient to use in computations or proofs. For example, the proof of 
renormalizability was first formulated in unitary gauge. Loop calculations might be more efficient in Feynman gauge, 
because of the simplified propagator structure, while many QCD processes benefit from an explicit projection on the 
physical external gluons. Tree-level heUcity amphtudes are usually computed in unitary gauge, etc... 



1.1.4 Higgs boson 

At this stage we have defined a perfectly fine electroweak theory with massive gauge bosons. All we need is a finite 
vacuum expectation value for S, which means this field spontaneously breaks the electroweak symmetry not by 
explicit terms in the Lagrangian but via the vacuum and relying on an unknown origin of this finite VEV. If we are 
interested in physics at or above the electroweak energy scale E ^ v some kind of ultraviolet completion of this S 
model should teU us what the S field's properties as a quantum object are. 

If we consider our E model itself the fundamental theory and promote the E field to a quantum field like all other 
Standard Model fields, we need to allow for quantum fluctuations of Tr(E^^E) around the vacuum value Tr E^S = 2. 
We can parameterize these new degrees of freedom as a real scalar field 




(1.44) 
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as long as this physical field H has a vanishing vacuum expectation value and therefore 

i(Tr(I]tE)) = ^(^l + :^y^=l (1.45) 

The factor in front of the fluctuation term H/v is not fixed until we properly define the physical Higgs field and make 
sure that its kinetic term does not come with an unexpected prefactor different from unity. On the other hand, if we 
assume that the neutral Goldstone mode has the correct normalization, the Higgs field should be added to S such 
that it matches this Goldstone, as we will see later in this section and then in more detail in Section[L2] 
The non-dynamical limit of this Higgs ansatz is indeed our sigma model in unitary gauge S^^S = 1, equivalent to 
H = 0. Interpreting the fluctuations around the non-trivial vacuum as a physical Higgs field is the usual Higgs 
mechanism, named after one of The University of Edinburgh's most famous sons. By the way, in the famous paper 
Peter Higgs' address is wrongly spelled Trait Institute where it should be Tait Institute. Tait is the guy who was hired 
by Edinburgh when James Clerk Maxwell was also applying for positions and the University of Edinburgh decided to 
not hire their arguably most famous son who is now watching over George Street. 



For this new Higgs field defines a potential following the original form Eq.(1.27 1 



2 \ V J 4 ^ 

The dots stand for higher-dimensional terms which might or might not be there. We will have a look at them in 



Section 1.2. 1 Some of them are not forbidden by any symmetry, but they are not realized at tree level in the Standard 



Model either. The minimum of this potential occurs at H = 0. But as mentioned already, we will have the entire 
Section [T. 2. l| on the Higgs potential. 

Let us recall one last time how we got to the Higgs mechanism from a static gauge invariant theory, the S model. 
From an effective field theory point of view we can introduce the Goldstone modes and with them gauge boson 
masses without introducing a fundamental Higgs scalar All we need is the finite vacuum expectation value for S to 
spontaneously break electroweak symmetry. For this symmetry breaking we do not care about quantum fluctuations of 
the E field, i.e. we do not distinguish between the invariant Tr S^E and its expectation value. Any properties of the E 
field as a quantum field are left to the ultraviolet completion, which has to decide for example if S is a fundamental or 
composite field. This way, the Higgs field could just be one step in a ladder built out of effective theories. Such a 
non-fundamental Higgs field is the basis for so-called strongly interacting light Higgs models where the Higgs field is 
a light composite field with a different canonical normalization as compared to a fundamental scalar. 

Counting degrees of freedom we should be able to write E as a complex doublet with four degrees of freedom, three 
of which are eaten Goldstones and one is the fundamental Higgs scalar For example, we can write 

l + ' + (1.47) 

V J V V \ 1^)2 — IWi V + H + tW3 J V 

The last step is just another way to write the 2x2 matrix as a bi-doublet in terms of the two SU{2) doublets 
containing the physical Higgs field and the Goldstone modes for the massive vector bosons W and Z 



1 / V + H — iw3 \ ^ — ^ f — iwi 

^\ W2-i'Wi J ^\ V + H + iw:^ 



(1.48) 



The vacuum expectation value v appearing in the and (j) doublets corresponds to (E) = 1. In this form the 
normalization of the two real scalars wj, and H is indeed the same, so their kinetic terms will be of the same form. 
The over-all factors l/-\/2 in the definition of the doublets are purely conventional and sometimes lead to confusion 
when for some people v — 246 GeV while for others v — 11 A GeV. The latter choice is a little less common but has 
the numerological advantage of v ^ rrit. In the Standard Model the two Higgs doublets (p and give mass to up-type 
and down-type fermions. 
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Apart from problems arising when we ask for higher precision and quantum corrections, the effective sigma model 
clearly breaks down at large enough energies which can excite the fluctuations of the sigma field and for example 
produce a Higgs boson. This is the job of the LHC, which is designed and built to take us into an energy range where 
we can observe the structure of electroweak symmetry breaking beyond the effective theory and the Goldstone modes. 



A last side remark — instead of deriving both relevant doublets for up and down-type fermion masses from one 
physical Higgs field (p and (p we can include two sigma fields in the fermion-mass terms 



■^D3 — —Q l''^Qu^u 



with the isospin projectors (1 ± T3)/2. In the gauge boson mass terms they appear as 



(1.49) 



■^1 



D2 



Tr 



Y(u)y(u)^^ 



Tr 



Y(d)y(d)ii 



Each of the two S fields we can express in the usual representation 



^{U.d) ^ J 



Vu,d 



Vu,d 



From the gauge-boson masses we know that 



which means that the longitudinal vector bosons are 

w = cos/3 w^"' + sin (3 w^'^^ 



V sin P and Vd = v cos /3 



(1.50) 



(1.51) 



(1.52) 



(1.53) 



This two-Higgs doublet model is for example the minimal choice in supersymmetric extensions of the Standard 
Model. The ratio of its two vacuum expectation values tan (3 — Vu/vd is one of the most important new physics 
parameters for example in B physics. But type-II two-Higgs doublet models where one Higgs doublet gives mass to 
up-type and another one to down-type fermions are much more general than that. 



1.1.5 Custodial symmetry 



Analyzing the appearance of Ap in Eqs.( 1.25|1.36 1 we will see that not only higher energies, but also higher precision 



leads to a breakdown of the effective sigma model. At some point we start seeing how the relative size of the W and Z 
masses are affected by quantum fluctuations of the sigma field, i.e. the Higgs boson. 



From the construction in Section 1.1.3 we know that electroweak symmetry breaking by a sigma field or Higgs 
doublet links the couplings of neutral and charged currents firmly to the masses of the W and Z bosons. On the other 



hand, the general renormalizable Lagrangian for the gauge boson masses in Eq.( 1.25 i involves two terms, both 
symmetric under SU (2) l x U{1)y and hence allowed in the electroweak Standard Model. The mass values coming 
from Tr[V";iF^] give mw and mz proportional to g = gw and gz. Their relative size can be expressed in terms of the 
weak mixing angle 9^, together with the assumption that Gp or g universally govern charged current (W^) and 
neutral-current (W^) interactions. At tree level this experimentally very well tested relation is 



9l 



cos^ e,„ = cl 



In general, we can introduce a free parameter p which breaks this relation 



9l 



al p 



mz mz ^/p , 



(1.54) 



(1.55) 



which from measurements is very strongly constrained to be unity. In .^d2 the Z-mass term proportional to Ap 
precisely predicts such a deviation p 7^ 1. To bring our Lagrangian into agreement with measurements we better find a 
reason to constrain Ap to zero, and the SU{2) x U (1) gauge symmetry unfortunately does not do the job. 
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Looking ahead, we will find that in the Standard Model p = 1 is actually violated at the one-loop level. This means 
we are looking for an approximate symmetry of the Standard Model. What we can hope for is that this symmetry is at 
least a good symmetry in the SU{2)]^ gauge sector and slightly broken elsewhere. One possibility along those lines is 
to replace the SU (2)^ x U{1)y symmetry with a larger SU (2)^ x SU{2)^ symmetry. At this stage this extended 
symmetry does not have to be a local gauge symmetry, a global version of SU(2)l combined with a global SU{2)ii is 
sufficient. This global symmetry would have to act like 

S ^ [/SV^^ U e SU{2)l V e SU{2)r 

Tr(I]^S) ^ Tr (l/E^t/^C/SF^) = Tr(S^E) . (1.56) 

In this setup, the three components of W^^ form a triplet under SU {2)l and a singlet under SU (2) r. If we cannot 
extract ra as a special generator of SU{2)l and combine it with the U{1) hypercharge the W and Z masses have to be 
identical, corresponding to Cu, = 1 at tree level. 

In the gauge boson and fermion mass terms computed in unitary gauge the S field becomes identical to its vacuum 
expectation value 1. On the symmetry breaking VEV the combined global SU{2) transformations act as 

(S) ^ {UT.V^) = {UIV^) ^ UV^ 1 . (1.57) 

The last step, i.e. the symmetry requirement for the Lagrangian can only be satisfied if we require U — V .\n other 
words, the vacuum expectation value for S breaks SU{2)]^ x SU {2)r to the diagonal subgroup SU{2)i^^r. The 
technical term is precisely defined this way — the two SU (2) symmetries reduce to one remaining symmetry which 
can be written as C/ = T^. Depending on if we look at the global symmetry structure in the unbroken or broken phase 
the custodial symmetry group is either SU{2)fi or SU (2) l+r- Its effect should nevertheless be the same. 



Beyond tree level the global SU (2) l x SU{2)r symmetry structure can protect the relation between the gauge boson 
masses shown in Eq.( 1.54[ i. In other words, the custodial symmetry structure protects the tree-level value of p = 1. 



From Eq.(1.56 1 we immediately see that it allows all terms in the Higgs potential but it changes the picture not 
only for gauge boson but also for fermion masses. If fermions reside in SU (2) l as well as SU (2) r doublets we 
cannot implement any difference between up-type and down-type fermions in the Lagrangian. The custodial 
symmetry is only intact in the limit for example of identical third generation fermion masses mf, — nit. 
The measured masses mt ^ mi, affect the protected tree-level value p — 1: self energy loops in the W propagator 
involve a mixture of the bottom and top quark, while the Z propagator includes pure bottom and top loops. Skipping 
the loop calculation we quote their different contributions to the gauge boson masses as 




(1.58) 

In the Taylor series above the assumption of 5 being small is of course not realistic, but the result is nevertheless 
instructive: the shift vanishes very rapidly towards the chirally symmetric limit mt ^ mi,. The sign of the contribution 
of a chiral fermion doublet to Ap is always positive. In terms of realistic Standard Model mass ratios it scales like 
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Figure 1: Allowed range of Higgs masses in the Standard Model after taking into account electroweak precision data, 
most notably the p parameter contribution from the Higgs itself, Eq.(1.59 1. Figure from the LEP electroweak working 
group. 



We have already argued that hypercharge or electric charge break custodial symmetry. From the form of the covariant 
derivative D^jS including a single T3 we can guess that the SU (2) a symmetry will not allow B field interactions 
which are proportional to s^, ~ ^^1/4. A second contribution to the p parameter therefore arises from Higgs loops in 
the presence of g' ^ 



ApD V " log^- (1-59) 

The sign of this contribution implies that larger Higgs masses give negative contributions to the p parameter. 



There is another parameterization of the same effect, namely the T parameter. It is part of an effective theory 
parameterization of deviations from the tree-level relations between gauge boson masses, mixing angles, and neutral 
and charged current couplings. In general, for example the mass ratio mw/mz depends on all three parameters 
S, T, U. Typical experimental constraints form an ellipse in the S vsT plane along the diagonal. However, for 
example the leading contribution from the different bottom and top masses can be translated into a contribution 
a AT ^ Ap, centered around T = 0. 

There are two reasons to discuss these loop contributions breaking the custodial symmetry in the Standard Model. 
First, Ap is experimentally very strongly constrained by electroweak precision measurements, which means that 
alternative models for electroweak symmetry breaking usually include the same kind of approximate custodial 
symmetry by construction. As a matter of fact, this constraint is responsible for the almost death of (technicolor) 
models which describe the Higgs boson as a bound state under a new QCD-like interaction. 

Even more importantly, in the Standard Model we can measure the symmetry violations from the heavy quarks and 
from the Higgs sector shown in Eqs.( |1.58|1.59| l in electroweak precision measurements. Even though the Higgs 
contributions depend on the Higgs mass only logarithmically, we can then derive an upper bound on the Higgs mass 
of the order of 0(200 GeV), as shown in Figure[T] This strongly suggests that if we are faced (mostly) with the 
Standard Model at the weak scale the Tevatron and at the LHC will be looking for a fairly light Higgs boson — or 
something that very much looks like a light fundamental Higgs boson 

However, we should firmly keep in mind that the p parameter only points to a light fundamental Higgs boson if we 
assume the Standard Model Higgs mechanism. For any other model it might point to something similar to a light 
Higgs field, but does not have to be fundamental. Including additional fields in a model can even turn around this 
prediction and prefer a heavy Higgs state. 



One last question is: how do physical modes which we introduce as = exp(— iw/w) transform under the 
different global SU (2) symmetries which make up the custodial symmetry and can we construct a model of 
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electroweak symmetry breaking around the custodial symmetry? This brings us back to the example of the photon 
mass, where we first saw Goldstone's theorem at work. 

Under the usual SU (2) ^ symmetry we know the transformation reads S — > JJE with U = cxp{—ia ■ t/2). The 
transformation properties of the Goldstone modes w follow from the infinitesimal transformations 



1 



a 



2 

W ■ T 



,W ■ T 



implying 
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w - 


^ w' = w - 


--a 









(1.60) 



so [/ is a non-linear transformation , since w'^ is not proportional to Wa- The same structure we find for the SU{2)j^ 
transformation. This model of electroweak symmetry breaking we call a non-linear sigma model. When we construct 
a Lagrangian this non-linear symmetry transformation forbids mass terms, gauge interactions, Yukawa couplings, and 
quadratic potential terms for these modes in E. As discussed in Section [T.l.l| only derivative terms like the kinetic 
term and derivative couplings are allowed under the SU (2) l and SU (2)^ symmetries. 

Similarly, we can evaluate the transformation of these physical modes under the diagonal symmetry group SU{2)n^i^ 
with E ^- U^W and instead find 
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implying 



which is a linear transformation. In the fourth line we have used the commutator 



(a ■ t){w ■ t) = a ■ w + ir {a X w) 
[{a ■ r), (w ■ r)] — 2iT {a x w) . 



using Eq.( |1.7| l 



(1.61) 



(1.62) 



In other words, when we transform the physical modes corresponding to the broken generators in E by the larger 
symmetry SU (2) l x SU{2)b, we find a non-linear transformation, while the approximate symmetry S'[/(2)l+_r leads 
to a linear transformation. This is precisely what Goldstone's theorem predicts for the spontaneous breaking of a 
global electroweak symmetry. 



1.2 The Standard Model 

Before we discuss all the ways we can look for a Higgs Boson at the LHC we briefly review the Higgs mechanism in 
the Standard Model. In the last sections we have seen that there does not really need to be such a fundamental scalar, 
but electroweak precision data tells us whatever it is the Higgs should look very similar to a light fundamental scalar, 
unless we see some seriously new states and effects around the weak scale. 

To make it a little more interesting and since we are already in the mood of not taking the Standard Model Higgs 
sector too literally, in Section [I.2.1| we include higher dimensional operators on top of the usual renormalizable 
(dimension-4) operators in the Higgs potential. Such operators generally occur in effective theories based on 
ultraviolet completions of our Standard Model, but their effects are often small or assumed to be small. 
Once we want to analyze the behavior of the Higgs sector over a wide range of energy scales, like we will do in 



Sections 1.2.2 and 1.2.3 we need to take the Standard Model seriously and in turn find constraints on the structure and 



the validity of our Standard Model with a fundamental Higgs boson. 
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(1.63) 



1.2.1 Higgs potential to dimension six 

In the renormalizable Standard Model all terms in the Lagrangian are defined to be of mass dimension four, like 
mfipip or '4)di^i_'tl^ or F^^F^" . This mass dimension is easy to read off if we remember that for example scalar fields or 
vector-boson fields contribute mass dimension one while fermion spinors carry mass dimension 3/2. The same 
renormalizability assumption we usually make for the Higgs potential, even though from the previous discussion it is 
clear that higher dimensional terms — stemming from higher powers of Tr(S^^S) — can and should exist. 
Starting from the Higgs doublets introduced in Eq.( |1.48[ ) and for now ignoring the Goldstone modes the simplified 
doublet 

l^f -W2-IWI \ ^ ^ f 

leaves us with only two renormalizable potential terms in Eq.( 1.46| l, now written in terms of the Higgs doublet and 
with fx^ and A as prefactors 

-■^n^ VsM = A4>? + M^t + const . (1.64) 

To emphasize that renormalizability is a strong and not necessarily very justified theoretical assumption in LHC Higgs 
physics, we allow for more operators in the Higgs potential. If we expand the possible mass dimensions and the 
operator basis, there are exactly two gauge-invariant operators of dimension six we can write down in terms of the 
Higgs doublet i.e. before electroweak symmetry breaking 

O2 = -^(0V)'. (1.65) 

The prefactors in the Lagrangian are conventional, because to construct a Lagrangian we have to multiply these 
operators with general coefficients of mass dimension minus two, parameterized in terms of an unknown mass scale A 

i=l 

As long as the typical energy scale E in the numerator in our matrix element is small (i? <C A), the corrections from 
the additional operators are small as well. 

There is in principle one more possible operator (d^cf))^ {(p^ (j)){d^(p), but it violates the custodial symmetry and leads 
to a very large contribution to the electroweak precision parameter AS* — — f^v^ / . Such large contributions to 
S are firmly ruled out by electroweak precision data, so we ignore this operator in our analysis. 

Before we compute the Higgs potential including O2 we look at the effects of the dimension-6 operator Oi. It 
contributes to the kinetic term the Higgs field in the Lagrangian, before or after symmetry breaking 



2 ^ \ 2 / " \ 2 

1 
2 



{H + vf d^,H d^'H . (1.67) 



We use the symbol H for the Higgs field as part of (f), because from the formula above it is likely that there will be a 
difference between H and the physical Higgs field H at the end of the day. The contribution from Oi leaves us with a 
combined kinetic term 



H = + ij^H = N H 



(1.68) 



This is a simple rescaling to define the canonical kinetic term in the Lagrangian, but it also means we have to make 
sure we replace H with H in the entire Higgs sector. 
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The mass of the Higgs boson will be given by all quadratic terms from the potential. The regular dimension-4 
contributions in terms of the shifted Higgs field H are 

.'sM = .^<^ + A(^ = f(H^.-) + ^(-..eHV-.). 
The terms in the parentheses will later contribute to the Higgs mass in terms of /i, v and A. 



Including the additional dimension-6 operator O2 we can write the Higgs potential as 



y = /i^i(/>i^+Ai0r+ 



3A2' 



(1.70) 



The positive sign in the last term of the potential V ensures that for /2 > the potential is bounded from below for 
large field values 0. The non-trivial minimum at 7^ is given by 
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(1.71) 



defining the minimum position |0| — v 12. The two solutions of the quadratic equation for v /2 are 
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(1.72) 



The first solution we have expanded around the Standard Model minimum, Wq = —ji^/X. The second, high-scale 

solution is not the vacuum relevant for our Standard Model. Note that from the VF, Z masses we know that 

V = 246 GeV so v is really our first observable in the Higgs sector, sensitive to the higher-dimensional operators. 



Including this additional potential operator in terms of H reads 
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1 [H + vf 
3 



8 



1 

24 



3„.3 



r2„,4 



(1.73) 



which after combining with the dimension-4 contributions quadratic in the Higgs field shown in Eq.( 1.69 1 gives us the 
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complete mass term to dimension six 
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replacing fi using Eq.( |1.72 1 
replacing H using Eq, 



1.681 



At^ 
2 



1 - 



A2 

A2 



2A2A 

,2 



m 
~2 



Hjj2 



2A2A/ 



(1.74) 



Including dimension-6 operators the relation between the vacuum expectation value, the Higgs mass and the factor in 
front of the \(t>\'^ term in the potential changes. Once we measure the Higgs mass at the LHC, we can compute the 
trilinear and quadrilinear Higgs self couplings by collecting the right powers of H in the Higgs potential, in complete 
analogy to the Higgs mass above. We find 
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(1.75) 



This gives the Feynman rules 



.3m2^ 




(1.76) 



and 



1 - 



A2 A2r 



3A2r 



(1.77) 



From this discussion we see that in the Higgs sector the Higgs self couplings as well as the Higgs mass can be 
computed from the Higgs potential and depend on the operators we take into account. As mentioned before, in the 
Standard Model we use only the dimension-4 operators which appear in the renormaUzable Lagrangian and which 
give us for the Higgs mass and self couplings 

= 2Au^ = -2/i^ 

with V = Vf) ^ 246 GeV. When the Higgs sector becomes more complicated, not the existence but the form of such 
relations between masses and couplings will change. 

With this information we could now start computing Higgs observables at the LHC, but let us first see what else we 
can say about the Higgs potential from a theoretical point of view. 



and ^,^„ = _!^ff3_^^4^ (178) 
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1.2.2 Unitarity 

If we want to compute transition amplitudes at very high energies the Goldstone modes become very useful. In the V 
rest frame we can write the three polarization vectors of a massive gauge boson as 


















(D 














(1.79) 

If we boost V into the z direction, giving it a four-momentum — [E, 0, 0, \p\), the polarization vectors become 

■^T,! - I n I - I 1 I ^L- ZT^ 1 n I — ^ I n I = z:^^ • (1.80) 



Very relativistic gauge bosons are dominated by their longitudinal polarization |e^| ^ E/my ^ 1. This longitudinal 
degree of freedom is precisely the Goldstone boson. This means that at high energies we can approximate the 
complicated vector bosons Z, as scalar Goldstone bosons wq, w±. This comes in handy for example when we 
next talk about unitarity as a constraint on the Higgs sector. This relation between Goldstones and gauge bosons at 
very high energies is called the equivalence theorem. 

Based on the equivalence theorem we can compute the amplitude for W^W~ W^W~ scattering at very high 
energies {E ^ mw) in terms of scalar Goldstones bosons. Three diagrams contribute to this processes: a four-point 
vertex, the s-channel Higgs exchange and the f-channel Higgs exchange: 

\ / \ / ^ 

\ / \ / 

/ N + / \ + I 

/ \ / \ I 

/ \ / \ 

/ \ / \ ^ 

For that we need some basic Feynman rules, for example the Goldstone couphngs to the Higgs boson and the 
four-Goldstone coupUngs. We start with the Higgs doublet, now including the Goldstone modes 



^hYwI + 0{w°), (1.81) 



2 1 



where in the last step we neglect aU terms without the Goldstone fields. For the potential we then get, only keeping the 
relevant terms at dimension four 



2 
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Focussing on the scattering of charged Goldstones w±w± — > w±w± we use the corresponding fields 



w± — {wi ± iw2)/V2. They appear in the above expression as 



2w+W-, so we find the terms 



TTl l, , , n 



2v 



H 2 w+W- 
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2v^ 



(1.83) 



which fix the two Feynman rules we need. Linking the Lagrangian to a Feynman rule involves one complication: for 
each positively charged Goldstone in the vertex there are two ways we can identify them with the Lagrangian fields. 
In addition, there are also two choices to identify the two negatively charged Goldstones, which implies an additional 
combinatorial factor four in the Feynman rule. Including a common factor (— i) the two Feynman rules then become 
—2im?jj/v'^ and —irnj^/v. 

The amplitude is given in terms of the Mandelstam variables s and t which describe the momentum flow through 



the two Higgs propagators and which we will properly introduce in Section 2. LI 
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(1.84) 



For this process we want to test the unitarity of the S matrix, which we write in terms of a transition amplitude 
S" = 1 + iA. The S matrix should be unitary to conserve probability 

11 = 5^5 = (1 = 1 A^) + ^ AU = -i(A-A^). (1.85) 

If we sandwich {A — A"^) between identical asymptotically free fields, which means that we are looking at forward 
scattering with a scattering angle 6* — > 0, we find in the high-energy limit or for massless external particles 
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(1.86) 



Assuming that our Lagrangian is hermitian this imaginary part corresponds only to absorptive terms in the scattering 
amplitude. This is the usual formulation of the optical theorem reflecting unitarity in terms of the transition amplitude 
A. 



To include the dependence on the scattering angle 9 we decompose the transition amplitude into partial waves 

A= 16^^(2^ + 1) (cos 0) a, with / dxPi{x)Pi.{x)^ ——6w , (1.87) 

ordered by the orbital angular momentum /. Pi are the Legendre polynomials of the scattering angle 9, which obey an 
orthogonaUty condition. The scattering cross section including all prefactors and the phase space integration is then 
given by 
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The relation between the integral over the scattering angle 9 and the Mandelstam variable t we will discuss in more 
detail in Section[2.1.1| Applied to this perturbative series in partial waves the optical theorem requires 



— (21 + 1) \aif = ^ 167r(2Z + 1) Ima, 

s s 



Imai . 



(1.89) 



using Pi (cos 9 — 1) — 1. This condition we can rewrite as 
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|Rea/| < - 



(1.90) 



once we recognize that the condition on Im a/ and on Re a; is a unit cycle around a; — (0, 1/2) with radius 1/2. 



It is important to remember that in the above argument we have formulated the constraint for each term in the sum 
over the Legendre polynomials. Mathematically, this is well justified, but of course there might be physics effects 
which lead to a systematic cancellation between different terms. This is why the constraint we compute is referred to 
as perturbative unitary. For Goldstone scattering we compute the supposedly leading first term in the partial wave 
expansion from the amplitude 
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In the high-energy limit s ^ m?^ this translates into an upper limit on the Higgs mass which in Eq.( 1.84 1 enters as the 
Goldstone coupling in the numerator 



87rw2 



< 



(870 GeV)^ 



(1.92) 



This is the maximum value of iriH maintaining perturbative unitarity for WW — > T4^T4^ scattering. Replacing the 
Higgs mass by the self coupling we can formulate the same constraint as A < 2tt. In this form we see that perturbative 
unitarity goes right to the limitations of perturbation theory, which means that we should include higher order effects 
as well as higher dimensional operators to get a reliable numerical prediction in the range of < 1 TeV. In the 
following section we will have another less perturbatively limited look at the behavior of large A. 
Of course, if we limit s to a finite value this bound changes, and we can compute a maximum scale Smax which leaves 
WW — >■ WW perturbatively unitary for fixed rriH' for itih ^ v this typically becomes ^/s^ ^1.2 TeV. This 
number is one of the motivations to build the LHC as a high energy collider with a partonic center-of-mass energy in 
the few-TeV range. If something should go wrong with the Standard Model Higgs sector we can expect to see 
something else curing unitarity around the TeV scale. 



One last but very important comment we need to make: this unitarity argument only works if the WWH coupling is 
exactly what it should be. While perturbative unitarity only gives us a fairly rough upper limit on mn, it also uniquely 
fixes gwwH to its Standard Model value. Any sizeable deviation from this value again means new physics appearing 
at the latest around the mass scales of Eq.( 1 .92 1. 

Looking at processes like WW — > // or WW — ?► WWH or WW — > HHH we can exactly the same way fix 
all Higgs couplings in the Standard Model, including gnff, dHHH, gHHHH- The most important result of the 
unitarity test is probably not the upper bound on the Higgs mass, but the underlying assumption that the unitarity test 
only works in the presence of one Higgs boson if all Higgs couplings look exactly as predicted by the Standard Model. 
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1.2.3 Renormalization group analysis 



Two additional theoretical constraints we can derive from the renormalization group equation of the Higgs potential, 
specifically from the renormalization scale dependence of the self coupling A(Q^). Such a scale dependence arises 
automatically when we encounter ultraviolet divergences and absorb the 1/e poles into a minimal counter term. We 



will discuss this running of couplings in more detail in Section 2.2.1 for the running QCD coupling Ug. In the case of 



a running quartic Higgs coupling A the relevant s, t and w-channel diagrams only depending on A itself are 
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Skipping the calculation we quote the complete renormalization group equation including diagrams with the Higgs 
boson, the top quark and the weak gauge bosons inside the loops 
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(1.93) 



with Af = \/2mt/v. This formula will be the basis of the discussion in this section. 



The first regime we study is where the Higgs self coupling A becomes strong. Fixed order perturbation theory as we 
use it in the unitarity argument runs into problems in this regime and the renormalization group equation is the 



appropriate tool to describe this region. The leading term in Eq.(1.93 i reads 

Because of the positive sign on the right hand side the quartic coupling will become stronger and stronger and 
eventually diverge for larger scales Obviously, this divergence should not happen in a physical model and will 
give us a constraint on the maximum value of A allowed. The approximate renormalization group equation we can 
solve by replacing X — g^^ 
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A boundary condition X{Q^ 
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Starting from scales Q ^ v where the expression in brackets is close to one and moving towards larger scales the 
denominator becomes smaller until A actually hits a pole at the critical value Qpoie 

/n2 



1 



3 . , ^pole ! „ 

— Aolog— = 



3 

4^ 



Ao log 



log 



'Spole 


= 1 


^2 


Qpole 


47r2 




" 3a;^ 


Qpole 


— V exp 



2^ 
3Ao 



V exp 



47rV 



(1.97) 



1.2 The Standard Model 



27 




A [GeV] 



Figure 2: Triviality or Landau pole (upper) and stability bounds (lower) for the Standard Model Higgs boson in the 
nT-H — Q plane. Similar figures first appeared in Ref. |2|, involving this kind of physics argument from Ref. |3|. 



Such a pole is called a Landau pole and gives us a maximum scale beyond which we cannot rely on our perturbative 
theory to work. In the upper line of Figure |2] we show Qpoie versus the Higgs mass, approximately computed in 
Eq.( L97 1. As a function of the Higgs mass Qpoie gives the maximum scale were our theory is valid, which means we 
have to reside below the upper line in Figure |2] Turning the argument around, for given Qpoie we can read off the 
maximum allowed Higgs mass which in the limit of large cutoff values around the Planck scale 10^^ GeV becomes 
ruH < ISO GeV. 

This limit is often referred to as the triviality bound, which at first glance is precisely not what this theory is — trivial 
or non-interacting. The name originates from the fact that if we want our Higgs potential to be perturbative at all 
scales, the coupling A can only be zero everywhere. Any finite coupling will hit a Landau pole at some scale. Such a 
theory with zero interaction is called trivial. 



After looking at the ultraviolet regime we can go back to the full renormalization group equation Eq.( L93 1 and ask a 
completely different question: how long will A > ensure that our Higgs potential is bounded from below? 



This bound is called the stability bound. On the right hand side of Eq.( L93 i there are two terms with a negative sign 
which in principle drive A through zero. One of them vanishes for small A ^ 0, so we can neglect it. In the small-A 
regime we therefore encounter two finite competing terms 
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The usual boundary condition at X{v'^) = m'jj/{2v'^) is the starting point from which the top Yukawa coupling drives 
A to zero at another critical scale A((55tabie) — ^ which depends on the Higgs mass niH- The second (smaller) 
contribution from the weak gauge coupling ameliorates this behavior 
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From Eq.( 1.98 1 we see that only for energy scales below (5stabie("iff ) the Higgs potential is bounded from below and 
our vacuum stable. For a given maximum validity scale Qstabie this stability bound translates into a minimum Higgs 
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Figure 3: Branching ratios of the Standard-Model Higgs boson as a function of its mass, computed with HDECAY. 
Off-shell effects in the decays to WW and ZZ are taken into account. Figure found for example in Refs.||4]|5l. 



mass balancing the negative slope in Eq.( 1.98 i for which our theory is then well defined. In Figure|2]we show Qstabie 



as the lower curve, above which our consistent theory has to reside. 

Summarizing what we know about the Higgs mass in the Standard Model we already have indirect experimental as 
well as theory constraints on this otherwise undetermined parameter 

Strictly in the Standard Model, electroweak precision data points to the mass range mn ^ 200 GeV. This means at the 
LHC we are either looking for a light Higgs boson or we should expect some drastic modifications to our Standard 
Model which could alter this picture significantly and would be hard to miss and even more exciting to observe. 
From the renormalization group we have two pieces of information on the Higgs mass, again in the renormalizable 
Standard model: the Landau pole or triviality bound gives an upper limit on inn as a function of the cutoff scale. 
Vacuum stability gives a lower bound on mn as a function of the cutoff scale. Running both cutoff scales towards the 
Planck mass Qpoie, Qstabie 10^^ GeV, we see in Figure|2]that only Higgs mass values around 
niH = 130 • • • 180 GeV are allowed for a fundamental Standard Model. 



1.3 Higgs decays 

Signatures for new particles at colliders consist of a production process and a decay pattern. Both, the production and 
the decay can contribute to unique kinematic features which we can use to extract signal from background events. The 
actual new particle is then described by a Breit-Wigner propagator for unstable particles which we will discuss in 
detail in Section 2.1.2 Since the Higgs boson is a scalar there are no correlations between production and decay 
process, which simplifies the calculation and simulation of Higgs signatures. 



Unlike the production processes the Higgs decay pattern is as simple as it can be. At tree level all decay rates are 
determined by the Higgs coupling to Standard Model particles, which are fixed by unitarity. The rule for different 
Higgs decays is simple; because by definition the Higgs field couples to all particles (including itself) proportional to 
their masses it will preferably decay to the heaviest states allowed by phase space. This goes back to the condition 
(S) = 1 translated into the exclusive appearance of the combination {v + H) in the Higgs field (j) and in the 
Lagrangian. 

This behavior we see in Figure |3] Starting at low masses this first applies to decays to tt and hb. The relative size of 
their branching ratios around 10% : 90% is given by their masses or Yukawa couplings, evaluated in the appropriate 
renormalization scheme at the right scale. Once the (off-shell) decays to WW open, they very soon dominate. The 
dominant decays to bottom pairs and W pairs become equal for Higgs masses around 130 GeV. 
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Because of the small mass difference between the W and Z bosons the decay to ZZ is not as dominant, compared to 
the WW decay which has two degrees of freedom (W'^W^ and W^'W'^) in the final state. In particular in the region 
where the W decays first becomes on-shell we see a drop of in the still off-shell Z decays. For large Higgs masses the 
ratio of H WW and H — > ZZ decays is fixed by the relative factor of two, corresponding to the number of degrees 
of freedom forming the final state. Above the top threshold the tt decay becomes sizeable, but never really dominant. 

Because the bottom Yukawa coupling is so small, mi, /mw < 1 /30, one-loop induced couplings can almost compete 
with it. In particular the loop-induced decay to two photons plays an important role in LHC phenomenology. It 
proceeds via a top and a W triangle which enter with opposite signs in the amplitude, i.e. interfering destructively. 
The larger W contribution fixes the sign of the loop-induced coupling. 

The structure of the ^^H coupling is similar to the production via the loop-induced ggH coupling which we will 



discuss in Section 1.4 The reason for considering this decay channel are the LHC detectors. To extract a Higgs signal 
from the backgrounds we usually try to measure the four-momenta of the Higgs decay products and reconstruct their 
invariant mass. The signal should then peak around niH while the backgrounds we expect to be more or less flat. The 
LHC detectors, in particular CMS, are designed to measure the photon momentum and energy particularly well. The 
resolution on m^^ will at least be a factor of 10 better than for any other decay channel, except for muons. Moreover, 
photons do not decay, so we can use all photon events in the Higgs search, while for example hadronically decaying 
W/Z 2 jets are not particularly useful at the LHC. These enhancement factors make the Higgs decay to two 
photons a promising signature, in spite of its small branching ratio around 2 • 10 



-3 



Because an observed Higgs sector can deviate from the minimal Standard Model assumptions in many ways the LHC 
or other future colliders will study the different Higgs decays and, as a function of ran, answer the questions 

- are gauge-boson couphngs proportional to mw,z'^- 

- are these couplings dunension-3 operators? 

- are fermion Yukawa couplings proportional to nifl 

- is there a Higgs self coupling, i.e. a remnant of the Higgs potential? 

- do Xhhh and Xhhhh show signs of higher-dimensional operators? 

- are there any other unexpected effects, like a Higgs decay to invisible particles? 
But before we decay the Higgs we need to produce it... 



1.4 Higgs production in gluon fusion 

Looking for the Higgs boson at hadron colliders starts with bad news: at tree level the Higgs hardly couples to 
light-flavor quarks and has no coupling to gluons. This is because the Higgs boson couples to all Standard Model 
particles proportional to their mass — this is the same operator they get their mass from. Because the SU{Z)c 
symmetry of QCD is not broken there is no coupling to gluons at all. 

On the other hand, the protons at the LHC contain a lot of gluons, again something we will talk about in more detail in 
Section|2] so the question is if we can find and use a loop-induced coupling of two gluons to the Higgs. In spite of the 
expected suppression of the corresponding cross section by a one-loop factor (g^/ (IGtt'^))^ we would hope to arrive at 
an observable production cross section pp — > H. Numerically, it will turn out that the production of Higgs bosons in 
gluon fusion is actually the dominant process at the LHC, as shown in Figure |4] 

1.4.1 Effective gluon-Higgs coupling 

If an effective ggH coupling should be mediated by a closed Standard Model particle loop the top is the perfect 
candidate: on the one hand it has a strong coupling to gluons, and on the other hand it has the largest of all Standard 
Model couplings to the Higgs boson, rrit/v ^ 0.7. The corresponding Feynman diagram is 
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Figure 4: Left; production cross section for a Standard-Model Higgs boson at the LHC, as a function of the Higgs 
mass. Figure from Ref. [4J. Right: updated version including higher order corrections. 

k2 




ki 

We construct this effective coupling in three steps, starting with the Dirac trace occurring the top loop. All momenta 
are defined as incoming with — fc| = and — nij^. The Dirac indices of the two gluons are /i, h' and the loop 
momentum is q, so in the first step we need to compute 

T^'' = Tr [(^ + mt) 7'' {4 + h + nit) ^ + h + h + mt)] ■ (1.100) 

The calculational problem is the tensor structure of this trace. Because of gauge invariance we can neglect terms 
proportional to fc^ and fcj ; they would not survive the multiplication with the transverse gluon polarization {k ■ e — 0). 
In a so-called axial gauge we could also get rid of the remaining terms proportional to and fcj ■ 



However, there is a better way to compute this trace. We know that there is no tree-level Higgs coupling of the Higgs 
to two gluons, which would correspond to the fields HG^C^ with mass dimension three in the Lagrangian. So we 
need to find another operator mediating such a coupling, keeping in mind that it is loop induced and can therefore 
include a mass suppression by powers of the top mass. The Higgs can also couple to the field strength in the invariant 
form HG^^G^^^ with G^jy = d^A^ — + 0{A^). This operator has mass dimension five and arises from the 

dimension-6 gauge-invariant object cj)^ cj) G ^^G^'^ after breaking SU{2)l. 

The factor in front of this term is the effective coupling we are going to compute in this section. Before that it pays to 
briefly look at the operator itself. Switching from position space and its momentum operator into momentum space 
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shows that the gauge invariant operator Unking exactly two gluon fields to a Higgs field has to be proportional to the 
transverse tensor 

- ^ (s^^^ - (f^) with pr^Tp. = i p^^fci^ = = . (1.102) 



Using this known tensor structure of T^'' as defined in Eq.( 1 . 100 1 we can extract the scalar form factor F from the 
Dirac trace using 

y^i^^^pA;^ ^ P^^^T^"' ^ Prt^uPr F F . (1.103) 

This way we project out the relevant gluon tensor structure or the relevant degrees of freedom of the two gluons 
contributing to this effective coupling. Terms involving a larger number of gluon fields are related to this ggH 
coupling by non-abelian SU{Z) gauge invariance. 

One thing to admit at this stage is that nobody in the world really computes Dirac traces by hand anymore. There are 
powerful programs, like FORM, which do this job for us. Using it we find 
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Pt^..T'''' = ^ {-m% + 3m^,-—{hq){k2q)-2{kiq)+q^) . (1.104) 



Inside this trace there appears the loop momentum q, which we in our second step we have to consider as part of the 
loop integration. The effective ggH vertex includes the loop integral 
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The non-trivial q^ dependence of the numerator observed in Eq.( |1.104 1 we can take care of using a few tricks. For 
example, we use the relation q^ /{q^ — mf) = 1 + mf/{q'^ — ) and then shift q, knowing that the final result for 
this integral will be finite. This is non-trivial piece of information, because most loop calculations lead to ultraviolet 
divergences, which need to be removed by first regularizing the integral and then renormalizing the parameters. The 
reason why we do not see any divergences in this process is that for a renormalization we would need something to 
renormalize, i.e. a leading order process which receives quantum corrections. However, we only compute this 
one-loop amplitude because there is no tree-level vertex, i.e. nothing to renormalize, which means no ultraviolet 
divergences. 



While these tricks help a little, we still do not know how to remove the {kiq){k2q) term in Eq.( 1.104 1. The method of 
choice is a Passarino-Veltman reduction which turns tensor integrals into scalar integrals, where a scalar integral does 
not have powers of the loop momentum in the numerator For example, the scalar three-point function is given by 
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C{klklml;mt,mumt) = / r„2 ^21 ^ ^, _ ^21 ^ ^ i..\2 _ ^21 ■ (1-106) 



Combining our Dirac trace in Eq.( |1.104l ) with the tensor integral in Eq.( |1.105] l and the applying a reduction algorithm 
gives us 

^p;f^ = ^[2+(4m?-my C(0,0,m?,;m,,m,,mO] • d-lO?) 

ITT^ [...\[...\[...\ V2 ' ' 

The first term not proportional to any scalar integral has a curious origin. It comes from a combination of 0{e) terms 
from the Dirac trace in n = 4 — 2e dimensions and a two-point function which for the integration measure 1 /itt^ 
always includes the ultraviolet divergence 1 /e. 

Scalar integrals we can for example calculate using the Feynman parameterization 

——^ 3-=/ dxi---dxn5(^x,~l\-, (1.108) 
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but we usually obtain shorter analytical expressions using the Cuskosky cutting rule which links the imaginary part of 
a diagram or a scalar integral to the sum of all cut Feynman graphs. 



The cut rule is directly related to the unitarity of the S matrix and the optical theorem discussed in Section 1.2.2 
Limiting ourselves to scalar amplitudes/integrals, the cut rule tells us that the sum of all cut one-loop or squared scalar 
diagrams has to vanish, including the two external cuts which correspond to the untouched amplitude A and its 
complex conjugate A* . This gives us a very useful expression for the imaginary part of the amplitude 
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The factor IGvr^ corresponds to the integral measure in the scalar integrals which can be arranged such that d'^q/ (itt^) 
gives a typical prefactor of one. Cutting diagrams means replacing all internal propagators by 
1/(9' ^ "^') ^ 27r 0{qo) 5{q^ — m?). From this imaginary part we compute the complete amplitude or scalar 
integral. If we know the pole or cut structure of the ampUtude, we can make use of the Cauchy integral for a complex 
analytic function A[z) 
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counter-clockwise 



and compute the unknown real part of A{q^) integrating such that the kinematic cut for real values to the right of 
q^ = mf + nn\ is outside the integration path 
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(mj+m|) 



This step assumes a sufficiently fast convergence on the integration contour for large momenta. This method of 

computing for example scalar integrals is known to produce the most compact results. 

The finite scalar three point function which appears in our effective coupling Eq.( 1.107] i has the form 
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For general top and Higgs masses it reads 
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including imaginary or absorptive terms for r < 1. The dimensionless variable r is the appropriate parameter to 
describe the behavior of this scalar integral. For example the low-energy limit of the scalar integral, i.e. the limit in 
which the top loop becomes heavy and cannot be resolved by the external energy of the order of the Higgs mass, will 
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be given by r > 1 which means niH < 2mt- In contrast to what many people who use such effective vertices assume, 
the expression in Eq.(jTTTT3jl is vaHd for arbitrary Higgs and top masses, not just in the heavy top limit. 
Expressing our Dirac trace and loop integral in terms of this function /(r) we find for our effective coupling in 
Eq.i 
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Using this result we can as the third and last step of our calculation collect all factors from our Feynman diagram and 
compute the effective ggH coupling 
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where the numerical factors originate from the closed fermion loop, the three top propagators, the two top-gluon 
couplings, the top Yukawa coupling, the color trace, the unmatched loop integration measure, and finally the result 



computed in Eq.( 1.1 14 1. 



Based on Eq.( 1. 101 1 which including prefactors reads 



(1.116) 



we can include the complete form factor in the effective Lagrangian. This finally defines the Feynman rule we are 
interested in 



-Sfggff D - gggH H G^" G 



With i5^^^ = -^^ir[l + (l-r)/(r)] 



(1.117) 



dropping the (5"''. It is important to notice that the necessary factor in front of the dimension-5 operator is 1/w and not 
l/nit- This is a particular feature of this coupling, which does not decouple for heavy top quarks because we have 
included the top Yukawa coupling in the numerator. Without this Yukawa coupling, the heavy top limit r ^ cx) of the 
expression would be zero, as we will see in a minute. 

Of course, just like we have three-gluon and four-gluon couplings in QCD we can compute the gggH and the ggggH 
couplings from the ggH coupling simply using gauge invariance. This set of n-gluon couplings to the Higgs boson is 
again not an approximate result in the top mass. Gauge invariance completely fixes the n-gluon coupling to the Higgs 
via one exact dimension-5 operator in the Lagrangian. These additional gluon field arise from the commutator of two 
gluon field in the field strength tensor, so they only exist in non-abelian QCD and cannot be generalized to the 
photon-photon-Higgs coupling. 
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1.4.2 Low-energy theorem 

The general expression for gggH is not particularly handy, but for light Higgs bosons we can write it in a more 
compact form. We start with a Taylor series for /(r) in the heavy-top limit t ^ 1 
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and combine it with all other r-dependent terms from Eq.( |l.l 17 1 

T [l + (l-r)/(r)] =T 
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In this low-energy or heavy-top limit where we have decoupled the top quark from the set of propagating Standard 
Model particles. The ggH coupling does not depend on rrit anymore and gives a finite result. Computing this finite 



result in Eq.( 1.1 15 1 we included the top Yukawa coupling in the numerator We emphasize again that while this 



low-energy approximation is very compact to analytically write down the effective ggH coupling, it is not necessary 
to numerically compute processes involving the effective ggH coupling. 

In this low-energy limit we can easily add more Higgs bosons to the loop. Attaching an external Higgs leg to the 
gluon self energy diagram simply means replacing one of the two top propagators with two top propagators and 
adding a Yukawa coupling 

1 1 ^/2TOt 1 
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This treatment including a gamma matrix in ^ 7^1?^ is, strictly speaking, nonsense. However, it gives us an idea 
how we can in the limit of a heavy top derive the ggH^ couplings from the gluon self energy or the ggH coupling 
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The first two effective couplings with one or two external Higgs legs are the triangular form factor 
gggH = —ids/ (127r) and the box form factor gggHH — (127r). To obtain the correct mass dimension each 

external Higgs field appears as H/v. Using log(l + x) = — X]n=i(~^)"/"- we can eventually re-sum this series of 
effective couplings in the form 
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In Section 1.2.2 we note that there should be a relative factor between the Lagrangian and the Feynman rule 
accounting for more than one way to identify the external legs of the Feynman rule with the fields in the Lagrangian. 
For n neutral Higgs fields the effective coupling has to include a factor of 1/n which is precisely the denominator of 
the logarithm's Taylor series. 
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Such a closed form of the Lagrangian is very convenient for simple calculations and gives surprisingly exact results 
for the gg H production rate at the LHC, as long as the Higgs mass does not exceed roughly twice the top mass. 
However, for example for gg — > H+jets production its results only hold in the limit that aJJ jet momenta are much 
smaller than m^. It also becomes problematic for example in the gg — > HH process close to threshold, where the 
momenta of slow-moving Higgs bosons lead to an additional scale in the process. We will come back to this process 
later. 



1.4.3 Signatures 

To discuss the Higgs production for example in gluon fusion we would normally need to know how to deal with 
gluons inside the incoming protons, how to parameterize the phase space of the Higgs decay products, and how to 



kinematically distinguish interesting events from others. All of this we will piece by piece introduce in Section 2.1 In 
contrast, throughout the following sections on Higgs production at the LHC we will limit ourselves to some very basic 
phenomenological features and postpone any discussion on how to compute these features. 

Without knowing any theoretical particle physics we can discuss the main feature or problem of hadron collider 
physics: there is no such thing as a signal without a background. More precisely, there is no kinematic configuration 
which is unique to a signal event and cannot appear as an unlucky combination of uninteresting Standard Model or 
QCD processes and detector effects. This implies that any LHC measurement will always be a statistics exercise in 
some kind of event counting combined with a probability estimate on the signal nature of a given event. 

The first quantity we can compute and analyze at colliders is the total number of events expected from a certain 
production process in a given time interval. This number of events is the product of the proton-proton LHC 
luminosity measured in inverse femtobarns, the total production cross section measured in femtobams, and the 
detection efficiency measured in per-cent. In other words, a predicted event event rate it is split into a collider-specific 
number describing the initial state, a process-specific number describing the physical process, and a detector-specific 
efficiency for each particle in the final state. 

The latter is the easiest number to deal with: over the sensitive region of the detector, the fiducial volume, the 
detection efficiency is a set of numbers depending on the nature of the detected particle and its energy. This number is 
very good for muons, somewhere between 90% and 100%, and less than 1/3 for tau leptons. Other particles typically 
range somewhere in between. 

For theorists luminosity is simply a conversion number between cross sections which we compute for a living and 
event numbers. People who build colliders use units involving seconds and square meters, but for us inverse 
femtobarns work better Typical numbers are: a typical year of LHC running at design luminosity could deliver up to 
10 inverse femtobams per year in the first few years and three to ten times that later At the end of 201 1 one inverse 
femtobarn is a realistic estimate. The key numbers and their orders of magnitude for typical signals are 

A^events = O-tot • ^ = 10 • • • 300 fb" ^ CTtot = 1 ' ' ' 10^ fb . (1.124) 

Different cross sections for Tevatron and LHC processes are shown in Figure |5] 

Finally, talking about cross sections and how to compute them we need to remember that at the LHC there exist two 
kinds of processes. The first involves all particles which we know and love, like old-fashioned electrons or slightly 
more modem W and Z bosons or most recently top quarks. All of these processes we call backgrounds. They are 
described by QCD, which means QCD is the theory of the evil. Top quarks have an interesting history, because when I 
was a graduate student they still belonged to the second class of processes, the signals. These either involve particles 
we have not seen before or particles we want to know something about. Such new particles are usually produced via 
QCD effects as well, so QCD as we will discuss it in Section|2]is not entirely evil. If we actually see new particles 
someone gets a call from Stockholm, while for the rest of the community the corresponding processes instantly turn 
into backgrounds. 

Because signals are things we had not seen before they are rare compared to much more rare that backgrounds. 
Digging out signal events from a large number of background events is what the remaining Higgs section will be 
about. Figure [s] shows that at the LHC the production cross section for a pair of bottom quarks is larger than 10^ nb or 
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Figure 5; Production rates for signal and background processes at hadron colliders. The discontinuity is due to the 
Tevatron being a proton-antiproton collider while the LHC is a proton-proton collider The two colliders correspond 
to the X-axis values of 2 TeV and something between 7 TeV and 14 TeV. Figure from Ref. 171 . 

10^^ fb, the typical production rate for W or Z bosons ranges around 200 nb or 2 x 10^ fb, the rate for a pair of 
500 GeV supersymmetric gluinos is 4 x 10^ fb, and the Higgs rate can be as big as 2 x 10^ fb. To extract such signals 
from huge backgrounds we need to describe the backgrounds with an incredible precision, at least those background 
events which populate the signal region in phase space. The high energy community has agreed that we call a five 
sigma excess over the known backgrounds a signal discovery 

g 

—= = Ncr > 5 (Gaussian limit) 

V B 

^fluct <5.8xl0"'^ (fluctuation probabiUty) . (1.125) 

Never trust anybody who wants to sell you a three sigma evidence as a discovery, because everyone who has been 
around for a few years has seen a great number of those go away. People usually have reasons to advertize such 
effects, like a need for help by the community, a promotion, or a wish to get the Stockholm call, but all they are really 
saying is that their errors do not allow them to make a conclusive statement. On the other hand, in the Gaussian limit 
the statistical significance improves with \/C. So all we need to do is wait for the significance to hit five sigma. 

One last aspect we have to at least mention is the trigger. Because of the sheer mass of data at the LHC, we will not be 
able to write every LHC event on tape. As a matter of fact, we could not even write every top pair event on tape. 
Instead, we have to decide very fast if an event has the potential of being interesting in the light of the physics 
questions we are asking at the LHC. Only these events we keep. Before there is a mis-understanding: while 
experimentalists are reluctant to change triggers these are not carved in stone, so as a functioning high energy physics 
community we will not miss great new physics just because we forgot to include it in the trigger menu. For now we 
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Figure 6: Simulated statistical significance for different Higgs production and decay channels for an integrated lumi- 
nosity of 30 fb^^ (left: ATLAS; right: CMS). Five standard deviations over the backgrounds are required for discovery. 



can safely assume that above an energy threshold we will keep all event with leptons or photons, plus as much as we 
can events with missing energy (i.e. neutrinos in the Standard Model and dark matter particles in new-physics 
models) and jets with high energy coming from resonance decays. This trigger menu reflects the general attitude that 
the LHC is not built to study QCD, and that very soft final states for example from bottom decays are best studied in 
the LHCb experiment instead of ATLAS and CMS. 

With this minimal background of collider phenomenology we can look at Higgs production in gluon fusion, combined 
with different Higgs decays. The total Higgs production cross section through the loop induced ggH coupling we 
show in Figure]?] For a reasonably light Higgs boson the cross section ranges around at least 30 pb, which for relevant 
luminosities starting around 30 fb^^ means 10^ events. The question is: after multiplying with the relevant branching 
ratio and detection efficiencies, which of the decays can be distinguished from the Standard Model background 
statistically? Since gluon fusion really only produces the Higgs boson the details of the production process do not help 
much with the background suppression. The results of experimental simulations, for example by the ATLAS 
collaboration, are shown in Figure]6] The complete list of possible Higgs decays, ordered by decreasing branching 
ratio according to Figure]3] is: 

gg ^ H ^ bb is hopeless, because of sheer size of the QCD continuum background gg bb, which according 
to Figure ]5] exceeds the signal by roughly eight orders of magnitude. There is little to cut on except for the 
invariant mass of the bb pair with an 0(10%) mass resolution. What makes things worse, this channel will as it 
stands not be triggered on. 

gg ^ H ^ T^T^ is problematic, because of the low velocity of the Higgs which makes the reconstruction of 



■niT-T ^ TTiH hard. We will discuss this decay and its reconstruction in detail in Section L5.3 It is widely 
assumed that Higgs production in gluon fusion is too close to threshold to see the decay to tau leptons, but this 
might still change. 

gg ^ H ^ is, in spite of the small rate, very promising. Because m^^ can be reconstructed to 0(1%) this 
observable has incredibly precise side bins to the left and the to right of the Higgs peak. This is what for 
example the electromagnetic calorimeter of CMS has been designed for The main problem is backgrounds for 
example from pions mistaken for photons, while theory input will play no role in this analysis. Moreover, the 
peak in the invariant mass of the photons is great to measure the Higgs mass: once we see a Gaussian peak we 
can determine its central value with a precision of Tdetector/A/S' (in a signal dominated sample with S signal 
events), i.e. at the per-mille level. 
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gg —i' H —i' W^W~ has a large rate, but once one of the W bosons decays leptonically the Higgs mass is hard 
to reconstruct. All we can do is reconstruct a transverse mass variable, which we discuss in Section [J!4l On the 
other hand, the backgrounds are electroweak and therefore small. The most promising analysis relies on angular 
correlations — if the two gauge bosons come from a spin-zero resonance they have to have opposite 
polarization; because the W coupling to fermions is purely left handed this implies that the two leptons prefer to 
move into the same direction as opposed to back-to-back. Because there is not much more to cut on the 
expected significance drops sharply with the decreasing branching ratio once we require off-shell W decays. 

gg ^ H ^ ZZ works great for ZZ to four leptons, in particular muons, because of the fully reconstructed 
mu ~ mzz ^ TTiH- Of all Higgs channels it requires the least understanding of the LHC detectors. Therefore 
it is referred to as 'golden channel'. Its limitation are the leptonic Z branching ratio and the the sharp drop in 
the off-shell Higgs branching ratio towards smaller Higgs masses. The semi-leptonic decay of the Z pair might 



work once we ask for a boosted hadronically decaying Z, as we will discuss in Section 3.2.2 



gg H Zj is a little like 77, but with a smaller rate and a further reduced branching ratio of Z — > /i^/i^ 
or Z — > e+e^. Instead of combining the advantages of H ^ ZZ and H -fy this channel combines more of 
the disadvantages, so it is not likely to be helpful. Of course, as for any channel seeing it will give us more 
information on the Higgs boson, so we should not give up. 

gg ^ H ^ invisible is not predicted in Standard Model; it is obviously hopeless if only the Higgs is produced, 
because we would be trying to extract a signal of missing energy and nothing else. 'Absolutely nothing' in 
addition to some QCD remnant is not a good signature for the trigger. 



1.5 Higgs production in weak boson fusion 

Going back to Figure|4]we see that while gluon fusion gives the largest Higgs production rate at the LHC, there are 
other promising channels to study. In the Standard Model the Higgs has sizeable couplings only to the W and Z 
bosons and to the top quark, so instead of via the top Yukawa coupling we can produce Higgs bosons via their gauge 
boson couplings. This induces two channels, the larger of which is weak boson fusion qq — > qqH: two incoming 
quarks each radiate aW 01 Z boson which merge and form a Higgs. Because the LHC is a pp collider and because the 
proton mostly contains the valence quarks (uud) and low-a; gluons it is important that this process can proceed as 
ud — > duH, where the u radiates a and the d radiates a . The Feynman diagram for this process is 



If the Higgs were a Z boson, it could also bremsstrahlung off the incoming or outgoing quarks, but for Higgs 
production at colliders we safely assume that the first two generation fermions are massless. That is at least unless we 
discuss a muon collider as a specific way to produce Higgs bosons. 

In a way, weak boson fusion looks like double deep inelastic scattering, one from each of the protons. This is one of 
the key observations which in Section [T. 5. 2| we will use for background suppression via the central jet veto. The 
double deep inelastic scattering approximation is also a good way to compute corrections to the weak boson fusion 
production rate, at least provided we neglect kinematic distributions. 



1.5.1 Production kinematics 

In the Feynman diagrams for weak-boson fusion Higgs production we encounter intermediate massive gauge boson 
propagators. They induce a particular shape of the kinematic distributions of the final state jet. First, we need to quote 
the exact calculation showing that in the matrix element squared we will usually find one power of pT in the 
numerator. With this information we can look for the maximum in the prj = Pt,w spectrum as a function of the 
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momentum in the beam direction, p^, and the absolute value of the two-dimensional transverse momentum px 
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for some number C < 1. This admittedly hand- waving argument shows that in weak-boson-fusion Higgs production 
the transverse momenta of the outgoing jets peak at values below the W mass. In reality, the peak occurs around 
Pt ~ 30 GeV. This transverse momentum scale we need to compare to the longitudinal momentum given by the 
energy scale of valence quarks at the LHC, i.e. several hundreds of GeV. 

These two forward jets are referred to as tagging jets. They offer a very efficient cut against QCD backgrounds: 
because of their back-to-back geometry and their very large longitudinal momentum, their invariant mass rrijj will 
easily exceed a TeV. For any kind of QCD background this will not be the case. Compared to Higgs production in 
gluon fusion the tagging jets are an example how features of the production process which have little or nothing to do 
with the actual Higgs kinematics can help reduce backgrounds — the largest production rate does not automatically 
yield the best discovery prospects. 

Moving on to the Higgs kinematics, in contrast to the jets the Higgs and its decay products are expected to hit the 
detector centrally. So we are looking for two forward jets and for example two r leptons or two W bosons in the 
central detector. Last but not least, the Higgs is produced with finite transverse momentum which is largely 
determined by the kinematic and the acceptance cuts on the forward jets and their typical transverse momentum scale 

Pth ~ '^w- 



Compared to the Higgs production in gluon fusion we buy this distinctive signature and its efficient extraction from 
the background at the expense of the rate. Let us start with the partonic cross sections: the one-loop amplitude for 
gg H is suppressed by asj/t/(47r) ^ (1/10) (2/3) (1/12) = 1/180. For the production cross section this means a 
factor of (1/180)2 ^ 1 /400G0. The cross section for weak boson fusion is proportional to g^, but with two additional 
jets in the final state. Including the additional phase-space for two jets this roughly translates into 
(2/3)^ l/(167r)2 = (64/729) (1/2500) - 1/25000. These two numbers governing the main LHC production cross 
sections roughly balance each other. 

The difference in rate which we see in Figure |4] instead arises from the quark and gluon luminosities. In weak boson 
fusion the two forward jets always combine to a large partonic center-of-mass energy 

X1X2S > {pj^i + Pj,2)^ = '2{Pj,iPj,2), with the two parton momentum fractions xi,2 and the hadronic center of mass 
energy ^/s = 14 TeV. Producing a single Higgs in gluon fusion probes the large gluon parton density at typical parton 
momentum fractions x ^ ran / y/s ^ 10^^. This means that each of the two production processes with their specific 
incoming partons probes its most favorable parton momentum fraction: low-a; for gluon fusion and high-x for valence 
quark scattering. Looking at typical LHC energies, the gluon parton density grows very steeply for x < 10^^ xhis 
means that gluon fusion wins: for a 150 GeV Higgs the gluon fusion rate of ^ 30 pb clearly exceeds the weak boson 
fusion rate of ^ 4 pb. On the other hand, these numbers mean little when we battle an 800 pb background relying 
on kinematic cuts either on forward jets or on Higgs decay products. 

In Figure|6]we see that for large Higgs mass the weak boson fusion rate approaches the gluon fusion rate. The two 
reasons for this behavior we mentioned already in this section: first of all, for larger x values the rate for gg H 
decreases steeply with the gluon density, while in weak boson fusion the already huge partonic center of mass energy 
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due to the tagging jets ensures that an increase in mn makes no difference anymore. Even more importantly, there 
appear large logarithms because the low-px enhancement of the quark- splitting. If we neglect mw in the weak 



boson fusion process the pxj distributions will diverge for small p^.j like 1 /pr.j, as we will see in Section 2.3.3 After 
integrating over px.j this yields a logp™|/|5™" dependence of the total rate. With the W mass cutoff and a typical 
hard scale given by itih this logarithm becomes 

/ \ 2 o 



awBFCx log^ ^ log^ . (1.127) 



For niH = ©(TeV) this logarithm gives us an enhancement by factors of up to 10, which makes weak boson fusion 
the dominant Higgs production process. 

Motivated by such logarithms, we will talk about partons inside the proton and their probabiUty distributions for given 
momenta in SectionP 



2.3.3 



In the effective W approximation we can resum the logarithms appearing in Eq.( 1. 127 1 or 



compute such a probability for W bosons inside the proton. This number is a function of the partonic momentum 
fraction x and can be evaluated as a function of the transverse momentum pT- Because the incoming quark inside the 
proton has negligible transverse momentum, the transverse momenta of the W boson and the forward jet are identical. 
These transverse momentum distributions in pr.w — Pt.j look different for transverse and longitudinal gauge bosons 

i'T(-i,PT>-- 4^2 2x + (1 - x) m2,)2 4^2 2x pI 

The couplings gA.v describe the gauge coupling of the W bosons to the incoming quarks. Looking at large transverse 
momenta pT ^ mw the radiation of longitudinal W bosons falls off sharper than the radiation of transverse W 
bosons. This different behavior of transverse and longitudinal W bosons is interesting, because it allows us to gain 
information on the centrally produced particle and which modes it couples to just from the transverse momentum 
spectrum of the forward jets and without looking at the actual central particle. 

The problem with the effective W approximation is that it is a poor description of Higgs production at the LHC. The 
simple reason is that the Higgs mass is of the order of the W mass, as are the transverse momenta of the W and the 
final state jets, and none of them are very small. Neglecting for example the transverse momentum of the W bosons or 
the final-state jets will not give us useful predictions for the kinematic distributions, neither for the tagging jets nor for 
the Higgs. For the SSC, the competing design to the LHC in Texas which unfortunately was never built, this might 
have been a different story, but at the LHC we should not describe W bosons (or for that matter top quarks) as 
essentially massless partons inside the proton. 



1.5.2 Jet ratios and central jet veto 

From the Feynman diagram for weak boson fusion we see that the diagram describing a gluon exchange between the 
two quark lines multiplied with the Born diagram is proportional to the color factor Tr T"^ Tr T^S"^^ = 0. The only 
way to avoid this suppression is the interference of two identical final-state quarks, for example in ZZ fusion. First, 
this does not involve only valence quarks and second, this assumes a phase space configuration where one of the two 
supposedly forward jets turns around and goes backwards, so the interfering diagrams contribute in the same phase 
spare region. This means that virtual gluon exchange in weak boson fusion is practically absent. 
In Section|2]we will see that virtual gluon exchange and real gluon emission are very closely related. Radiating a 
gluon off any of the quarks in the weak boson fusion process will lead to a double infrared divergence, one because 
the gluon can be radiated at small angles and one because the gluon can be radiated with vanishing energy. The 
divergence at small angles is removed by redefining the quark parton densities in the proton. The soft divergence has 
to cancel between real gluon emission and virtual gluon exchange. However, if virtual gluon exchange does not 
appear, non-coUinear gluon radiation cannot appear either This means that additional QCD jet activity as part of the 
weak boson fusion process is limited to collinear radiation, i.e. radiation along the beam line or at least in the same 
direction as the far forward tagging jets. Gluon radiation into the central detector is suppressed by the color structure 
of the weak boson fusion process. 
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While it is not immediately clear how to quantify such a statement it is a very useful feature, for example looking at 
the top pair backgrounds. The WWbb final state as a background to qqH, H — WW searches includes two bottom 
jets which can mimic the signal's tagging jets. At the end, it turns out that it is much more likely that we will produce 
another jet — > tf+jet so only one of the two bottom jets from the top decays needs to be forward. In any case, the 
way to isolate the Higgs signal is to look at additional central jets. 

As described above, for the signal additional jet activity is limited to small-angle radiation off the initial-state and 
final-state quarks. For background like top pairs this is not the case, which means we can reduce all kinds of 
background by vetoing jets in the central region above pT,j ^ 30 GeV. This strategy is referred to as central jet veto or 
mini-jet veto. Note that it has nothing to do with rapidity gaps at HERA or pomeron exchange, it is a QCD feature 
completely accounted for by standard perturbative QCD. 

From QCD we then need to compute the probability of not observing additional central jets for different signal and 



background processes. Postponing the discussion of QCD parton splitting to Section 2.3.2 we already know that for 
small transverse momenta the pr.j spectra for massless states will diverge, as shown in Eq.( 1.127 i. Looking at some 
kind of n-particle final state and an additional jet radiation we can implicitly define a reference point p"" at which the 
diverging rate for one jet radiation (T„+i starts to exceed the original rate cr„, whatever the relevant process might be 



da- 



[fi-) = I dpT., -^P^ = CJr. . (1.129) 

This condition defines a point in px below which our perturbation theory in as, i.e. in counting the number of 
external partons, breaks down. For weak boson fusion Higgs production we find p™' ^ 10 GeV, while for QCD 
processes like production it becomes p™' — 40 GeV. In other words, jets down to px =10 GeV are perturbatively 
well defined for Higgs signatures, while for the QCD backgrounds jets below 40 GeV are much more frequent than 
they should be looking at the perturbative series in ag. This fixes the px range where a central jet veto will be helpful 
to reject backgrounds 

PT.j > 30 GeV and rjf^ < rij < r^f^^^ . (1.130) 

The second condition reminds us of the fact that only central jets will be rare in weak boson fusion. The smaller the 
Pt threshold the more efficient the central jet veto becomes, but at some point experimental problems as well as 
non-perturbative QCD effects will force us to stay above 20 or 30 or even 40 GeV. 

If we assign a probability pattern to the radiation of jets from the core process we can compute the survival probability 
of such a jet veto. If we assume that we can describe successive jet radiation using Poisson statistics the probability of 
observing exactly n jets given an expected tiq jets is 



(1.131) 



/(n;no) = ^i^ ^ /(0;no) 



This defines the so-called exponentiation model. We consistently fix the expectation value in terms of the inclusive 
cross sections producing at least zero or at least one jet, 

no = ' . (1.132) 

This ensures that the inclusive jet ratio cti / ctq is reproduced by the ratio of the corresponding Poisson distributions. In 



Section 2.3.4 we will see why such inclusive cross sections for the production of a certain final state together with an 



unspecified number of jets is an appropriate observable. Only in Section 2.5.3 we will actually be able to formulate 
observables for final states with a fixed number of jets. Including this expectation value no into Eq.( |1.131~| returns a 
veto survival probability of exp(— (Ti/cto)- This comes out roughly as 88% for the weak boson fusion signal and as 
24% for the ti background. For the signal-to-background ratio this is an increase by a factor three. 

An alternative model starts from a constant probability of radiating a jet, which in terms of the inclusive cross sections 
(Tj, i.e. the production rate for the radiation of at least j jets, reads 

^3 + i{pT) ^p^^^(pm.n)^ (1^133) 



min^ 
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The expected number of jets is then given by 



which, assuming the series converges, turns into a requirement on p™". Radiating jets with a constant probabihty has 
been observed at many experiments, including most recently the LHC, and is in the context of T4^+jets referred to as 



staircase scaling. Some advanced simulational results we will discuss in Section 2.5.3 Even without saying anything 
on how to calculate exclusive processes with a fixed number of jets we can derive a particular property of the constant 
probability of staircase scaling: the ratios of the {n + l)-jet rate to the n-jet rate for inclusive and exclusive jet rates 
are identical. We can see this by computing the inclusive P^-^^ in terms of exclusive jet rates 



Eoo (excl) 



(excl) Y^oo (excl) 
"n ~r l^j=n+l"j 



(excl) ^oo pj ^(excl) 

/ — 7^ ■ — with -Ppxrl — — 7 .\ 

(excl) _|_ (excl) ^^00 pj '^'■^^ (excl) 

Z^j=0-'excl 

(excl) 



-^exclC^n 

_ 1 ~ ^excl _ -^excl 

~ , n P ,rr^'""''^ ~ 1 - ^'excl + f'excl 
^(excl) _|_ ^exclO^n 

1 -^excl 

= Pexcl. (1.135) 

Using this result we can check the validity of the Poisson ansatz from Eq.( |1.13l"] l for staircase scaling 

Po.so„ n^a + r)l ^ n + 1 ^^^^ ^^^^ 

^^exci) n!nJJ+' no P,-ad 

so the exponentiation model does not correspond to a constant probability of radiating jets, i.e. to staircase scaling. 

To show that the exponentiation model and staircase scaling are not the only assumptions we can make to compute jet 
rates we show yet another, but similar ansatz which tries to account for an increasing number of legs to radiate jets off. 
Based on 



PUpD , (1-137) 



the expectation for the number of jets radiated gives, again following Eq.( 1.134 1 



All of these models are more or less well motivated statistical approximations. The do not incorporate experimental 
effects or the non-perturbative underlying event, i.e. additional energy dependent but process independent jet activity 
in the detectors from many not entirely understood sources. For many reasons none of them is guaranteed to give us 



the correct numbers. By the time we get to Section 2.5.3 we will be able to more accurately describe the central jet 
veto in QCD. 



1.5 Higgs production in weak boson fusion 
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1.5.3 Decay kinematics and signatures 

The sizeable transverse momentum of the Higgs in the weak boson fusion process allows us to reconstruct the 
invariant mass of a rr system in the collinear approximation: if we assume that a r with momentum p decays into a 
lepton with the momentum xpand a neutrino, both moving into the same direction as the tau, we can write the 
two-dimensional transverse momentum of the two taus from the Higgs decay in two ways 

pi + P2 = — + — - fcl +k2 + i- (1.139) 

Xl X2 

The missing transverse momentum k is the measured vector sum of the two neutrino momenta. This equation is useful 
because we can measure the missing energy vector at the LHC in the transverse plane, i.e. as two components, which 



means Eq.( 1.139 1 is really two equations for the two unknowns xi and X2. Skipping the calculation of solving these 



two equations for xiX2 we quote the result for the invariant rr mass 

(1.140) 



Of ^ 9(^1^2) 
rrirT = 2 [piP2j = 2 



X1X2 



which for the signal has to correspond to the Higgs mass. From the formula above it is obvious that this 
approximation does not only require a sizeable ^ rrir, but also that back-to-back taus will not work — the two 
vectors contributing to k then largely cancel and the computation fails. This is what happens for the production 
channel gg ^ H ^ tt, where the Higgs boson is essentially produced at rest. 

Again, we can make a list of signatures which work more or less well in connection to weak boson fusion production. 
These channels are also included in the summary plot by ATLAS, shown in Figure|6] 

qqH, H ^ bbis problematic because of large QCD backgrounds and because of the trigger in ATLAS. The 
most worrisome background is overlapping events, one producing the two tagging jets and the other one two 
bottom jets. This overlapping scattering gives a non-trivial structure to the background events, so a brute force 
side-bin analysis will not work. 

qqH, H ^ TT can be a discovery channel for a light Higgs boson with mfj < 130 GeV. The approximate mass 
reconstruction might be as good as ^ 5 GeV. This channel is particularly useful in scenarios beyond the 
Standard Model, like its minimal supersymmetric extension. It guarantees the discovery of one Higgs boson 
over the entire supersymmetric parameter space without a dedicated SUSY search. 

qqH, H ^ -f-f should be compatible with gg H ^ with its smaller rate but improved background 
suppression. It is included in inclusive H ^ -fj analyses, and for neural net analyses zooming in on large 
Higgs transverse momenta it will dominate the inclusive analysis. 

qqH, H — > Z7 is difficult due to a too small event rate and no apparent experimental advantages. 

qqH, H — )• WW is a discovery channel for m^j > 135 GeV. In contrast to gg H ^ WW it also works for 
off-shell W decays, i.e. for Higgs masses below 150 GeV. This is where a multitude of background rejection 
cuts and a better signal-to-background ratio win over the large rate in gluon fusion. 

qqH, H ZZ is likely to work in spite of the smaller rate compared to gluon fusion. It might even be possible 
with one hadronic Z decay, but there are not many detailed studies available. 

qqH, H — > invisible is the only discovery channel for an invisible Higgs which really works at the LHC. It 
relies on the pure tagging-jet signature, which means it is seriously hard and requires a very good understanding 
of the detector and of QCD effects. 



Just a side remark for younger LHC physicists: weak boson fusion was essentially unknown as a production mode for 
light Higgses until around 1998 and started off as a very successful PhD project. This meant that for example the 
Higgs chapter in the ATLAS TDR had to be re-written. While it sometime might appear that way, there is no such 
thing as a completely understood field of LHC physics. Every aspect of LHC physics continuously moves around and 
your impact only depends on how good your ideas and their technical realizations are. 
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1.6 Associated Higgs production 

In Figure |4]there appears a third class of processes at smaller rates: associated Higgs production with heavy Standard 
Model particles, like W or Z bosons or tt pairs. Until the summer of 2008 the Higgs community at the LHC was 
convinced that (a) we would not be able to make use of WH and ZH production at the LHC and (b) we would not be 
able to see most of the light Higgs bosons produced at the LHC, because H ^ bb is no promising signature in gluon 
fusion or weak boson fusion production. 

One key to argument (a) are the two different initial states for signal and background: at the Tevatron the processes 
qq — T' ZH and q'q WH arise from valence quarks. At the LHC with its proton-proton beam this is not possible, so 
the signal rate will suffer when we go from Tevatron to LHC. The QCD background at leading order is 
qq Zg* — > Zbb production, with an off-shell gluon splitting into a low mass bottom quark pair At next-to-leading 
order, we also have to consider the ^-channel process qq — > Zbbg and its flipped counter part qg — > Zbbq. This 
background becomes more dangerous for larger initial state gluon densities. Moving from Tevatron to LHC the Higgs 
signal will decrease while the background increases — not a very promising starting point. 



With Ref. in the whole picture changed. We will discuss this search strategy in detail in Section 3.2.2 in the context 



of jets and jet algorithms at the LHC. It turns out that searches for boosted Higgs bosons are not only promising in the 
VH, H bb channel, but might also resurrect the ttH, H bb channel. These new channels are not yet included in 
the ATLAS list of processes shown in Figure|6]because the simulations are still at an early stage. But we can expect 
them to play a role in LHC searches for a light Higgs boson. 

This is another example of what is possible and what not in LHC phenomenology: it all really depends only on how 
creative you ai^e; that even applies to a field Hke Standard Model Higgs seaixhes, which is supposedly studied to death. 

1.7 Beyond Higgs discovery 

The prime goal of the LHC is to discover a new light scalar particle which we would suspect to be the Higgs boson. 
However, the Standard Model makes many predictions concerning the properties of a Higgs boson; as a matter of fact, 
all its properties except for its mass are fixed in the minimal one-doublet Higgs sector of the Standard Model. The 
question is, can we test at least some of these predictions? 

In this section we will briefly touch on two interesting questions relevant to LHC Higgs physics and to determining if 
what we might see at the LHC is really a Higgs boson. This is where we have seen the most progress in LHC Higgs 
physics over recent years: not only will we be able to see a light Higgs boson simultaneously in different production 
and decay channels, as discussed in Sections L4 and |L5[ we can also study many of its properties. In a way this 



section ties in with the effective theory picture we use to introduce the Higgs mechanism: the obvious requirements to 
include massive gauge bosons in an effective electroweak gauge theory leads us towards the Standard Model Higgs 
boson only step by step, and at any of these steps we could have stopped and postulated some alternative ultraviolet 
completion of our theory. 

1.7.1 CP properties 

One question for the LHC is: once we see a Higgs boson at the LHC, how do we test its quantum numbers? One such 
quantum number is its spin. A spin-one Higgs boson would not couple to two photons, so we only need to look at 
angular distributions for example in weak boson fusion to distinguish spin zero from spin two. 
Once we know that we are talking about a scalar field we still need to measure its CP properties. The fundamental 
CP-even Standard-Model Higgs bosons couples to the W and Z bosons proportional to g^'^. For general CP-even and 
CP-odd Higgs bosons there are two more gauge invariant ways to couple to W bosons, which altogether give us the 
three coupling tensor structures 



g'-" gt^'^-PlPl e^.p.KP2- (1-141) 

PlP2 



These are the only gauge invariant dimension-5 couplings of two gauge bosons to a (pseudo-) scalar field. The second 
one appears in the effective one-loop gluon-gluon-Higgs coupling in Eqs.l L117|L123 i. This second tensor is not 



orthogonal to g^", but we can replace it with any linear combination with g'^''. On the other hand, these two coupling 
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Figure 7: Distribution for the azimuthal angle between the two forward jets in WBF production with a subsequent 
decay H tt. The signal and the leading Z background are simulated at parton level. All signal rates are normalized 
to the Standard-Model value. Figure from Ref. [9J. 

structures nicely correspond to the gauge-invaiiant forms W^^^W^,, and W^^'^Wfj,^ in the electroweak Lagrangian, so 
we leave them like this. 



Going back to the transverse momentum spectra for the tagging jets shown in Eq.( 1 . 128 i we already know one way of 
telling apart these couplings: by construction, the dimension-3 WWH coupling proportional to g^^'^ comes out of the 
Higgs potential as a side product of the W mass term, i.e. it couples the longitudinal Goldstone modes in the W boson 
to the Higgs. In contrast, the CP-even dimension-6 operator is proportional to the transverse tensor, which means it 
couples the transverse W polarizations to the Higgs and therefore produces a harder pT spectrum of the tagging jets. 

A very useful observable is the azimuthal angle between the two tagging jets in weak boson fusion, i.e. the angle 



separating the two jets in the transverse plane. For the three gauge-invariant coupling structures shown in Eq.( 1.141 1 
we find three distinctly different normalized distributions of the azimuthal angle A(/>, shown in Figure]?] 
The tree-level coupling should in principle show no dependence of A0 at all. The decrease towards larger angular 
separation is an effect of kinematic cuts against the Z background which prefers large angular separation. Therefore, 
the signal is depleted in the background-rich region. The CP-odd and therefore antisymmetric e^^'^f"^ coupling 
vanishes once two momenta contracted by the Levi-Civita tensor are equal. This explains the zeros at = 0, tt, where 
the two transverse jet momenta are no more linearly independent. To explain the opposite shape of the CP-even Higgs 
we use the low transverse momentum limit of the tagging jets. In that case the Higgs production matrix element 
becomes proportional to the scalar product {p^ p^ ) which vanishes for a relative angle A(/) = 7r/2. 

A related method of telling apart the y'"' structure and the CP-odd coupling is looking at the H ZZ 

decay kinematics. Each of the decaying Z bosons defines a plane opened by the two lepton momenta. If the angle 

between these two planes in the Higgs rest frame is (p, its distribution can be written as 

^ cx 1 + acos((/)) + 6cos(20) . (1.142) 
d(p 

For the CP-odd Higgs coupling to W'^'^Wf^i, we find a = and b = 1/4, while for the CP-even Higgs coupling g^'' 
we find a > 1/4 depending on niH- This method only works if we observe the decay H ZZ — > 4£ with a good 
signal-to-background ratio S/B. Both methods of determining the Higgs coupling structure are equivalent, as we can 
see from their Feynman diagrams: starting from the Feynman diagram for H — > ZZ — > A£ we can switch two fermion 
legs from the final state into the initial state 
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and read the diagram right-to-left. The same information which we can find in the angular correlation of the tagging 
jets is encoded in the decay plane correlation. The advantage of the first is that a production-side correlation in weak 
boson fusion is independent of the Higgs decay and can be used for all Higgs masses and decay signatures. 

1.7.2 Higgs self coupling 

If we should ever observe a scalar particle at the LHC, the crucial question will be if this is indeed our Higgs boson 
arising from electroweak symmetry breaking. In other words, does this observed scalar field have a potential with a 
minimum at a non-vanishing vacuum expectation value? 

Of course we could argue that a dimension-3 Higgs coupling to massive W bosons really is a Higgs-Goldstone self 
coupling, so we see it by finding a Higgs in weak boson fusion. On the other hand, it would be far more convincing to 
actually measure the self coupling of the actual Higgs field. This trilinear coupling we can probe at the LHC studying 
Higgs pair production, for example in gluon fusion via the usual top quark loop 




Following exactly the same argument as presented in Section 1 .4. 1 we can derive the two tensor structures 
contributing to Higgs pair production for transverse gluons 



V2Pr = 9^" 



^3 ^1 '^■2 

kj,{kik2) 



2{k2k3)k'(k^ 2(fci/c3)fc^fc^ 



k^{kik2 
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with 



(fclfc3)(fc2fc3) 
(fclfc2) 



(1.143) 



The third momentum is one of the two Higgs momenta, so fc| — mj^. In this representation the two tensors are 
orthonormal, which means — 1 , P| = 1 and Pt ■ P2 = 0- The second tensor structure is missing in single Higgs 
production, so it only appears in the continuum (box) diagram and turns out to be numerically sub-leading over most 
of the relevant phase space. 



From Section 1 .4.2 on the effective Higgs-gluon coupling we know that in the low-energy limit we can compute the 
leading form factors associated with the triangle and box diagrams, both multiplying the transverse tensor 
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ki k2 / {kik2) for the incoming gluons 
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Close to the production threshold s ^ (2mH)^ the leading contribution to the loop-induced production cross section 
for gg -> HH involving the two Feynman diagrams above and the Higgs self coupling derived in Section [T.2.1| is then 
proportional to 
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(1.145) 



so the triangle diagram and the box diagram cancel. 

In this argument we assume that the Higgs self coupling in the Standard Model is proportional to tuh- To see 
deviations from this self coupling in the first term of Eq.( T7l45| l we can look at something like the mnH distribution 
and measure the threshold behavior. In the absence of any self coupling this threshold cancellation of the self coupling 
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Figure 8: The (parton-level) sensitivity limits for Higgs pair production and a measurement of the Higgs self coupling. 
The analysis is based on the decay HH WW. Figure from Ref. Ii 11 i . 



contribution with the continuum should be absent as well. The threshold contribution to Higgs pair production would 
be huge. This way a shape analysis of the threshold behavior will allow us to experimentally exclude the case of 
^HHH = which predicts an unobserved large enhancement of the production cross section at threshold. At this 
point it is still under study if such a measurement will work at the LHC and what kind of LHC luminosities would be 
required. 

Of course, the Higgs self coupling sticks out of the Higgs parameters, but all Higgs couplings would be interesting to 
measure. If the Higgs is light and if all the of gluon fusion and weak boson fusion channels are indeed observed with 
sufficient luminosity, a numerical fit of all couplings to all Higgs observables will be possible, for example using the 
SFitter reconstruction tool 1 10|. As you can see in the original paper, for integrated luminosities around 30 fb^^ the 
accuracy on the extracted couplings ranges from 30% to 70% error bands. Contrasting more specific hypotheses, like 
a strongly interacting light Higgs, with the Standard Model reduces these errors bars, possibly to 0(10%). 

1.8 Further reading 

At this point we are done with our brief review of the Higgs sector in the Standard Model and of contemporary Higgs 
phenomenology. From the discussions it should be clear that we have to move on to QCD, to understand the strengths 
and weaknesses of these searches and what distinguishes a good from a bad search channel. 
Before moving on we mention a few papers where you can learn more about Higgs phenomenology at the LHC. 
Luckily, for Higgs searches there are several very complete and useful reviews available: 

- You can look at the original articles by Higgs lfT2l[T3l or Brout and Englert lfT4ll . but they are rather short and 
not really on top of the phenomenological aspects of the topic. Other papers for example by Guralnik, Hagen, 
Kibble [15] tend to be harder to read for phenomenologically interested students. 

- Wolfgang Kilian's book fTSl on the effective field theory approach to the Higgs mechanism is what the 
corresponding sections in these lecture notes are largely based on it. The underlying symmetry structure, 
including the custodial symmetry is nicely described in Scott Willenbrock's TASI lectures ifTTl . 

- Abdelhak Djouadi's compilation on 'absolutely everything we know about Higgs phenomenology' is indeed 
exhaustive. It has two volumes, one on the Standard Model IS) and another one on its minimal supersymmetric 
extension @ 

- For more experimental details you might want to have a look at Karl Jakobs' and Volker Biischer's review of 
LHC Higgs physics lUSJ. 
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As a theory view on LHC Higgs physics, mostly focused on gluon fusion production and its QCD aspects, there 
is Michael Spira's classic [4 |. This is where you can find more information on the low -energy theorem. Michael 
and his collaborators also published a set of lecture notes on Higgs physics |19|. Watch out for how Michael 
summarizes the state of the art concerning weak boson fusion production of light Higgs bosons. 

As always, there is a TASI lecture on the topic. TASI lecture notes are generally what you should look for when 
you are interested in an area of high energy physics. Dave Rainwater did not only write his thesis on Higgs 
searches in weak boson fusion |20 |, he also left us all he knows about Higgs phenomenology at the LHC in his 
TASI notes Ell . 

Tao Han wrote a very comprehensible set of TASI lecture notes on basic LHC phenomenology, in case you need 
to catch up on this [22J. 

For some information on electroweak precision data and the p parameter, there are James Wells' TASI 
lectures |23|. 

If you are interested in Higgs production in association with aW oi Z boson and the way to observe boosted 
H ^ bb decays you need to read the original paper |8 1. The same is true for the ttH analysis. 

For cut rules and scalar integrals the best writeup I know is Wim Beenakker's PhD thesis. Unfortunately, I am 
not sure where you would get it from, so you have to drop Wim an email. 
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2 QCD 

Just as Section[T]is not meant to be a complete introduction into electroweak symmetry breaking but is aimed at 
introducing the aspects of Higgs physics most relevant to the LHC this section cannot cover the entire field of QCD. 
Instead, we will focus on QCD as it impacts LHC physics and searches for new physics at the LHC, like for example 
the Higgs searches discussed in the first part of the lecture. 



In Section 2. 1 we will introduce the most important process at the LHC, the Drell-Yan process or lepton pair 
production. This process will lead us through all of the introduction into QCD. Ultraviolet divergences and 
renormalization we will only mention in passing, to get an idea how much of the treatment of ultraviolet and infrared 



divergences works the same way. After discussing in detail infrared divergences in Sections 2.3 to 2.5 we will spend 
some time on modern approaches on combining QCD matrix element calculations at leading order and next-to-leading 
order in perturbative QCD with parton showers. This last part is fairly close to current research with the technical 
details changing rapidly. Therefore, we will rely on toy models to illustrate the different approaches. 

2.1 Drell-Yan process 

Most text books on QCD start from a very simple class of QCD processes, namely deep inelastic scattering. These are 
processes with the HERA initial state e^p. The problem with this approach is that for the LHC era such processes are 
not very useful. What we would like to understand instead are processes of the kind pp — ^ M^4-jets, pp — > i/4-jets, 
pp — > tt+jets, or the production of new particles with or without jets. These kind of signal and background processes 



and their relevance in an LHC analysis we already mentioned in Section L4.3 



From a QCD perspective such processes are very complex, so we need to step back a little and start with a simple 
question: we know how to compute the production rate and distributions for photon and Z production for example at 
LEP, e~^e^ j,Z ^ t^i~ . What is the production rate for this final state at the LHC, how do we account for quarks 
inside the protons, and what are the best-suited kinematic variables to use at a hadron collider? 

2.1.1 Gauge boson production 

The simplest question we can ask at the LHC is: How do we compute the production of a weak gauge boson? This 
process is referred to as Drell-Yan production. In our first attempts we will explicitly not care about additional jets, so 
if we assume the proton to consist of quarks and gluons and simply compute the process qq j, Z under the 
assumption that the quarks are partons inside protons. Gluons do not couple to electroweak gauge bosons, so we only 
have to consider valence quark vs sea anti -quark scattering in the initial state. Modulo the SU{2) and U{1) charges 
which describe the Zff and 7// couplings in the Feynman rules 



-Z7^ {IFl + rVn) with I ^ (T3 - 2Qsl) r^e ^ ^ {Zff) 



e^r^Qe (7//), (2.1) 

with T3 — ±1/2, the matrix element and the squared matrix element for the partonic process 

qq^l,Z (2.2) 

will be the same as the corresponding matrix element squared for e+e" 7, Z, with an additional color factor. The 
general amplitude for massless fermions is 

M = -ivihh'' (^Pl + rPfl) u{ki)e^ . (2.3) 

Massless fermions are a good approximation at the LHC for all particles except for the top quark. For the bottom 
quark we need to be careful with some aspects of this approximation, but the first two generation quarks and all 
leptons are usually assumed to be massless in LHC simulations. Once we will arrive at infrared divergences in LHC 
cross sections we will specifically discuss ways of regulating them without introducing masses. 



Squaring the matrix element in Eq.(2.3 1 means adding the same structure once again, just walking through it in the 



opposite direction. Luckily, we do not have to care about factors of (— i) since we are only interested in the absolute 



50 



2 QCD 



value squared. Because the chiral projectors Pl.r = (1 T 75)/2 are real and = 75 is symmetric the left and right 
handed gauge boson vertices described by the Feynman rules in Eq.( |2.1| i do not change under transposition, so for the 
production of a massive Z boson we obtain 



spin,pol,color 
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with(gA:i) = —(^1^2) 
withq^ = {h + kaY 



(2.4) 



The momenta are h and fc2 for the massless incoming (anti-) fermions and q = ~h — k2 for the outgoing gauge 
boson. The color factor accounts for the number of 5[/(3) states which can be combined to form a color singlet 
like the Z. 

An interesting aspect coming out of our calculation is that the l/m^-dependent terms in the polarization sum do not 
contribute — as far as the matrix element squared is concerned the Z boson could as well be transverse. This reflects 
the fact that the Goldstone modes do not couple to massless fermions, just like the Higgs boson. It also means that the 
corresponding matrix element squared for the photon production simply means not replacing q^ m\ in the last step. 



What is still missing is an averaging factor for initial-state spins and colors, only the sum is included in Eq.(2.4i. For 
incoming electrons as well as incoming quarks this factor Kij includes 1/4 for the spins. Since we do not observe 
color in the initial state, and the color structure of the incoming qq pair has no impact on the Z-production matrix 
element, we also average over the color. This means another factor 1 jN'^ in the averaged matrix element, which 
altogether becomes (in four space-time dimensions) 



(2.5) 



In spite of our specific case in Eq.( 2.4 1 looking that way, matrix elements we compute from our Feynman rules are not 
automatically numbers with mass unit zero. 



If for the partonic invariant mass of the two quarks we introduce the Mandelstam variable s = [h + ^2)^ = 2(/cifc2) 
momentum conservation implies s = q^ = m\. This simple one-particle phase space has only one remaining free 
parameter, the reduced polar angle y = (1 — cos 0)/2 ~ • • • 1. The azimuthal angle (f) plays no role at colliders, 
unless you want to compute gravitational effects on Higgs production at ATLAS and CMS. Any LHC Monte Carlo 
will either random-generate a reference angle for the partonic process or pick one and keep it fixed. The second 
option has at least once lead to considerable confusion and later amusement at the Tevatron, when people noticed that 
the behavior of gauge bosons was dominated by gravity, namely gauge bosons were supposed to only go up or down... 



2. 1 Drell- Yan process 
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In terms of the two parameters y and we can describe the phase space for a one-particle final state. In four 
space-time dimensions (this detail will become important later) we can compute a total cross section from the matrix 



element squared, for example as given in Eq.(2.4 1, as 



da 
dy 



K,, (l-r) IX r 



1 — cos ( 



m% 



y = 



(2.6) 



The mass of the final state appears in r, with r = for a massless photon. It would be replaced to include miy or the 
Higgs mass or the mass of a Kaluza-Klein graviton if needed. 



From LEP we know that such a heavy gauge boson we do not actually observe at colliders. What we should really 
calculate is the production for example of a pair of fermions through an s-channel Z and 7, where the Z might or 
might not be on its mass shell. The matrix element for this process we can derive from the same Feynman rules in 



Eq.(2.1 1, now for an incoming fermion ki, incoming anti-fermion k2, outgoing fermion pi and outgoing anti-fermion 
P2. To make it easy to switch particles between initial and final states, we can define all momenta as incoming, so 
momentum conservation means ki + k2 + Pi + P2 = 0. The additional Mandelstam variables we need to describe this 
(2 — >■ 2) process are t = 
state particles. The (2 



(fci + pi)^ < and u = {ki + ^2)^ < 0, as usually with s + t + u = for massless final 



2) matrix element for the two sets of incoming and outgoing fermions becomes 



+ rVR)u{ki) 



(2.7) 



The coupling to the gauge bosons are £ and r for the incoming quarks and £' and r' for the outgoing leptons. When we 
combine the four different spinors and their momenta correctly the matrix element squared factorizes into twice the 
trace we have computed before. The corresponding picture is two fermion currents interacting with each other through 
a gauge boson. All we have to do is combine the traces properly. If the incoming trace includes the indices fi and 1/ 
and the outgoing trace the indices p and a, the Z bosons link ^ and p as well as i' and a. 

To make the results a little more compact we compute this process for a massless photon instead of the Z boson, i.e. 
for the physical scenario where the initial-state fermions do not have enough energy to excite the intermediate Z 



boson. The specific features of an intermediate massive Z boson we postpone to Section 2. 1 .2 The assumption of a 
massless photon simplifies the couplings to + 



2Q e and the polarization sums to —g^v'- 



|X|2=4iV, (2Q2e2) (2Q'V)^ [k^,k^^ + k^kl^ - {k,k2)gn (-g^p) 



iPiP2)gn i-g.a) 



Q^Q'^e^ [k,pk!^ + k'^k2p ~ (fcifc2).g;] [p1p2. + PiuP'2 ~ {PiP2)gl\ 



16iV, Q2g'2g4 [2(fciPi)(fc2P2) + 2(fciP2)(fc2Pl) - 2{k,k2)ipiP2) - 2(fcifc2)(piP2) +4(fcifc2)(PlP2)] 

32iV, Q2g'2g4 1 [(k,p,){k2P2) + {kiP2){k2Pi)] 



8Nc g^Q'^e* ^ [s^ + 2st + 2t^] 



u 

T 



2„4 



(2.8) 



We can briefly check if this number is indeed positive, using the definition of the Mandelstam variable t for massless 
external particles in terms of the polar angle t = -s ( — 1 + cos 9)/2 — ■ ■ ■ 0: the upper phase space boundary t = 
inserted into the brackets in Eq.(2.8 1 gives [•••] = 1, just as the lower boundary t ~ — s with [•••] = 1 — 2 + 2 = 1. 
For the central value t = — s/2 the minimum value of the brackets is [• • • 1 = 1 — 1 + 0.5 = 0.5. 
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The two-particle phase space integration for massless particles gives us 




t = - (-1 + cos( 



(2.9) 



For our Drell-Yan process we then find the differential cross section in four space-time dimensions, using 

a = e2/(47r) 



da 



1 TT 



1 



i2Q'-(47ra)2 I 1 + 2- + 2 



dt s2 (47r)2 47Vc 



t J' 



t e 

1 + 2- + 2^ 
s s'^ 



(2.10) 



which we can integrate over the polar angle or the Mandelstam variable t to compute the total cross section 
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2t^' 






21" 




^ 3? 
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(2.11) 



As a side remark — in the history of QCD, the same process but read right-to-left played a crucial role, namely the 
production rate of quarks in e+e^ scattering. For small enough energies we can neglect the Z exchange contribution. 
At leading order we can then compute the corresponding production cross sections for muon pairs and for quark pairs 
in e~^e~ collisions. Moving the quarks into the final state means that we do not average of the color in the initial state, 
but sum over all possible color combinations, which in Eq.(2.9 1 gives us an averaging factor Kij — 1/4. Everything 
else stays the same as for the Drell-Yan process 



R = 



^7(e^ 



hadrons) 



i+e-) 



Z^quarks ^Je^Jrj 



47ra2 
3s 



-iVn 3 



2; = 



lliV, 



(2.12) 



for example for five quark flavors where the top quark is too heavy to be produced at the given e+e~ collider energy. 
For those interested in the details we did take one short cut: hadrons are also produced in the hadronic decays of 
e+e^ — > r+r^ which we strictly speaking need to subtract. This way, i? as a function of the collider energy is a 
beautiful measurement of the weak and color charges of the quarks in QCD. 



2.1.2 Massive intermediate states 

Before we move on to describing incoming quarks inside protons we should briefly consider the second Feynman 
diagram contributing to the Drell-Yan production rate in Eq.ljZTTJ, the on-shell or off-shell Z boson 

|X|2 ^ \M^+Mz\^ = \M^\^ + \Mz? + 2ReMzM-, . (2.13) 

Interference occurs in phase space regions where for both intermediate states the invariant masses of the muon pair are 
the same. For the photon the on-shell pole is not a problem. It has zero mass, which means that we hit the pole in 
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the matrix element squared only in the Umit of zero incoming energy. Strictly speaking we never hit it, because the 
energy of the incoming particles has to be large enough to produce the final state particles with their tiny but finite 

masses and with some kind of momentum driving them through the detector. 

A problem arises when we consider the intermediate Z boson. In that case, the propagator contributes as 
\Mf oc — 1^1%)"^ which diverges for on-shell Z bosons. Before we can ask what such a pole means for LHC 
simulations we have to recall how we deal with it in field theory. There, we encounter the same issue when we solve 
for example the Klein-Gordon equation. The Green function for a field obeying this equation is the inverse of the 
Klein-Gordon operator 

(□ + m^) G{x - x') = S'^{x - x') . (2.14) 
Fourier transforming G{x — x') into momentum space we find 

{n + m')G{x-x') = J ^ (□ + m^)e-'«-(---')G(g) 

^e--(— ') (V + m^)G(,) I 5\x-x') = j e-'-^^-^') 

^ {-q^ +m^)G{q) = l ^ G{q) • (2.15) 

The problem with the Green function in momentum space is that as an inverse it is not defined for q^ — m?. We 
usually avoid this problem by slightly shifting this pole following the Feynman ie prescription to m? — ie, or 

equivalently deforming our integration contours appropriately. The sign of this infinitesimal shift we need to 
understand because it will become relevant for phenomenology when we introduce an actual finite decay width of 
intermediate states. 

In the Feynman ie prescription the sign is crucial to correctly complete the go integration of the Fourier transform in 
the complex plane 



q^-m^+ie = ^^^""^ + ^^-""^^ '^^^ gg - {.^ - ie) 

-oo (go - vw"' - iej(go + vw"^ - «e) 



C2 



2 -2 

w = g 



e 



= e{xo) (f) +0{-xo) f dqo- T- TT^YT , — r-^ with e' = 

JcJ [qo - - ie')){qo + uj{l - le')) 2u:^ 



(2.16) 



In the last step we have closed the integration contour along the real go axis in the complex go plane. Because the 
integrand has to vanish for large go, we have to make sure the exponent — zxo ilm go = xolm go is negative. For 
xo > this means Imgo < and vice versa. This argument forces Ci to close for positive and C2 for negative 
imaginary parts in the complex go plane. 

The contour integrals we can solve using Cauchy's formula, keeping in mind that the integrand has two poles at 
go — ±w(l — ie'). They lie in the upper (lower) half plane for negative (positive) real parts of go. The contour Ci 
through the upper half plane includes the pole at go — w while the contour C2 includes the pole at go ^ w, all 
assuming a; > 0: 



J — c 



dqo -IT- = 27ri 

, g^ — + ze 



>0 .71" 

= —I — 



u> + oj{\ — ie') —uj~u{l — ie')_ 

[6»(xo)e-'"^° + 6'(-xo)e*'"^°] . (2.17) 
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2uj 



Oixo) 



2uj 



-Xo) 



2uj 



-xq) 



2uj 



Q{x^) 



Table 1: Contributions to the propagator function Eq. p.l8[ ) for both signs of ie. 



The factor (—1) in the C2 integration arises because Cauchy's integration formula requires us to integrate 
counter-clockwise, while going from negative to positive Rego the contour C2 is defined clockwise. Using this result 



we can complete the four-dimensional Fourier transform from Eq.(2.15 1 

G{x) = J d\ e-'(9-^)G((7) 



— —in / d q e 



q2 _ j^j2 _|_ jg 
1 



[9{xo)t 



0{~xo)e'' 



,„-^ 1 



in / Ae^'^" - [eixo)e-"^'>^'>S{qo - uj) + e{~Xo)e-"i°''"Siqo + uj)] 



= -in / d^g e-*('?-^) - [e(xo)6(uj - go) + 0(-xo)S(uj + go)] with J(x) = (5(-a;) 

= -in J d^q e-'^"-^^ ^ 2w [e{xo)5{uj'' - ql) + 9{-Xo)S{uj^ - gg)] 
= -2ni J Ae-^(«-") [9{xo) + 9{-xo)] S{q^ - oj^) 

^ -2ni J d^q e-'(«-^) [e{xo) + e{-xo)] 5{q^ - m^) with g^ - = g^ - . 

This is exactly the usual decomposition of the propagator function Ap{x) — 6'(xo) A+ (x) + 9{—xo)A^ (x) 
positive and negative energy contributions. 



(2.18) 



into 



Let us briefly recapitulate what would have happened if we instead had chosen the Feynman parameter e < 0. All 
steps leading to the propagator function in Eq.(2.18 1 we summarize in Table[T] For the wrong sign of ie the two poles 
in the complex go plane would be mirrored by the real axis. The solution with Re go > would sit in the quadrant 
with Im go > and the second pole at a negative real and imaginary part. To be able to close the integration path in 
the upper half plane in the mathematically positive direction the real pole would have to be matched up with 9{—xq). 
The residue in the Cauchy integral would now include a factor +1/ (2a;). At the end, the two poles would give the 
same result as for the correct sign of ie, except with a wrong over-all sign. 
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When we are interested in the kinematic distributions of on-shell massive states the situation is a little different. 
Measurements of differential distributions for example at LEP include information on the physical width of the 
decaying particle, which means we cannot simply apply the Feynman ie prescription as if we were dealing with an 
asymptotic stable state. From the same couplings governing the Z decay, the Z propagator receives corrections, for 
example including fermion loops; 



Such one-particle irreducible diagrams can occur in the same propagator repeatedly. Schematically written as a scalar 
they are of the form 



— TTiQ + le q + le 

i 



— ttIq + ie 



q'^ — TOq + le 
i 



r-t-iM') 



— rriQ + le \q mo + 



q^ — ttIq + le 



q^ — ?7i5 + ie 



TOq + ie 



1 - 



summing the geometric series 



— ttIq + ie 



q^ — rriQ + ie — 



(2.19) 



where we denote the loop as AP for reasons which will become obvious later. Requiring that the residue of the 
propagator be unity at the pole we renormalize the wave function and the mass in the corresponding process. For 
example for a massive scalar or gauge boson with a real correction M^(g^) this reads 



iZ 



q^ — mQ — A'P{q'^) q^ — m? 



for q^ ^ 



(2.20) 



including a renormalized mass m and a wave function renormalization constant Z. 



The important step in our argument is that in analogy to the effective ggH coupling discussed in Section 1.4.1 the 
one-loop correction A/^ depends on the momentum flowing through the propagator Above a certain threshold it can 
develop an imaginary part because the momentum flowing through the diagram is large enough to produce on-shell 
states in the loop. Just as for the ggH coupling such absorptive parts appear when a real decay like Z — > £+f ^ 
becomes kinematically allowed. After splitting A-P{q^ ) into its real and imaginary parts we know what to do with the 



real part; the solution to q^ 



KeM'^{q^) = defines the renormalized particle mass q^ — m? and the wave 



function renormaUzation Z. The imaginary part looks like the Feynman ie term discussed before 



ReAf2((j2) _ ilmA'p 



iZ 



q2 _ jyip. 



iZ 

ie - iZlmAP 



Z 



imV 



ImM2(g2 



(2.21) 



for e — >^ and finite T ^ 0. The link between the self energy matrix element squared and the partial width we can 
illustrate remembering one way to compute scalar integrals or one-loop amplitudes by gluing them together using 



tree-level amplitudes. The Cuskosky cutting rule discussed in Section 1.4.1 tells us schematically written 

Im A'P ^ A/^|cut = r because cutting the one-loop bubble diagram at the one possible place is nothing but squaring 

the two tree-level matrix element for the decay Z £^£^ . One thing that we need to keep track of, apart from the 
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additional factor m due to dimensional analysis, is the sign of the imT term which just like the ie prescription is fixed 
by causality. 

Going back to the Drell-Yan process qq — > £^£^ we now know that for massive unstable particles the Feynman 
epsilon which we need to define the Green function for internal states acquires a finite value, proportional to the total 
width of the unstable particle. This definition of a propagator of an unstable particle in the s channel is what we need 
for the second Feynman diagram contributing to the Drell-Yan process: qq ^ Z* ^ £^£^ . The resulting shape of the 
propagator squared is a Breit-Wigner propagator 



a{qq^ Z ^ £+£-) (X 


1 


1 


s — + imT 


(s — + rn^r^ 



(2.22) 



When taking everything into account, the (2 2) production cross section also includes the squared matrix element 
for the decay Z — > £^£^ in the numerator In the narrow width approximation, the (2 — > 2) matrix element factorizes 
into the production process times the branching ratio for Z £^£~, simply by definition of the Breit-Wigner 
distribution 

^To m^i\m^V^ = r„-^5(. - ml) ^ ^ BR(Z ^ ££) S{s ~ m|) . (2.23) 
i ->-u [s — J I '''' i tot ^ tot 

The additional factor tt will be absorbed in the different one-particle and two-particle phase space definitions. We 
immediately see that this narrow width approximation is only exact for scalar particles. It does not keep information 
about the structure of the matrix element, e.g. when a non-trivial structure of the numerator gives us the spin and 
angular correlations between the production and decay processes. 

Eq.( |2.23[ ) uses a mathematical relation we might want to remember for life, and that is the definition of the 
one-dimensional Dirac delta distribution in three ways and including all factors of 2 and tt 

Six) - / ? e-"' = lim e-"'/'^' = lim - , ^ . (2.24) 

The second distribution is a Gaussian and the third one we would refer to as a Breit-Wigner shape while most other 
people call it a Cauchy distribution. 



Now, we know everything necessary to compute all Feynman diagrams contributing to muon pair production at a 
hadron collider. Strictly speaking, the two amplitudes interfere, so we end up with three distinct contributions: 7 
exchange, Z exchange and the 7 — Z interference terms. They have the properties 

- for small energies the 7 contribution dominates and can be linked to R. 

- on the Z pole the rate is regularized by the Z width and Z contribution dominates over the photon. 

- in the tails of the Breit-Wigner distribution we expect Z — 7 interference. 

- for large energies we are again dominated by the photon channel. 

- quantum effects allow unstable particles like the Z to decay off-shell, defining a Breit-Wigner propagator 

- in the limit of vanishing width the Z contribution factorizes into a ■ BR. 



2.1.3 Parton densities 



At the end of Section 2.1.1 the discussion of different energy regimes for R experimentally makes sense — at an 
e+e^ collider we can tune the energy of the initial state. At hadron colliders the situation is very different. The energy 
distribution of incoming quarks as parts of the colliding protons has to be taken into account. If we assume that quarks 
move collinearly with the surrounding proton, i.e. that at the LHC incoming partons have zero px, we can define a 
probability distribution for finding a parton just depending on the respective fraction of the proton's momentum. For 
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this momentum fraction x — ■ ■ -1 the parton density function (pdf) is denoted as fi(x), where i denote the different 
partons in the proton, for our purposes u, d, c, s, g and depending on the details b. All incoming partons we assume to 
be massless. 

In contrast to so-called structure functions a pdf is not an observable. It is a distribution in the mathematical sense, 
which means it has to produce reasonable results when integrated over together with a test function. Different parton 
densities have very different behavior — for the valence quarks (uud) they peak somewhere around x < 1/3, while 
the gluon pdf is small at a; ^ 1 and grows very rapidly towards small x. For some typical part of the relevant 
parameter space {x = IQ^'^ ■ ■ ■ 10^^) it roughly scales like fg{x) oc x^^. Towards smaller x values it becomes even 
steeper. This steep gluon distribution was initially not expected and means that for small enough x LHC processes 
will dominantly be gluon fusion processes. 



While we cannot actually compute parton distribution functions fi{x) as a function of the momentum fraction x there 
are a few predictions we can make based on symmetries and properties of the hadrons. Such arguments for example 
lead to sum rules: 



The parton distributions inside an anti-proton are linked to those inside a proton through CP symmetry, which is an 
exact symmetry of QCD. Therefore, for all partons we know 

f^i^)^M^) fl{^)^h{^) f!i^) = fai^) (2.25) 

for all values of x. 

If the proton consists of three valence quarks uud, plus quantum fluctuations from the vacuum which can either 
involve gluons or quark-antiquark pairs, the contribution from the sea quarks has to be symmetric in quarks and 
antiquarks. The expectation values for the signed numbers of up and down quarks inside a proton have to fulfill 

(iV„) = dx iUx) - Mx)) = 2 (TVrf) = dx {h{x) - fsix)) = 1 . (2.26) 



Similarly, the total momentum of the proton has to consist of sum of all parton momenta. We can write this as the 
expectation value of J2 

= / dxx fg{x) + M^) + fgi^)^ = 1 (2.27) 

What makes this prediction interesting is that we can compute the same sum only taking into account the measured 
quark and antiquark parton densities. We find that the momentum sum rule only comes to 1/2. Half of the proton 
momentum is carried by gluons. 



Given the correct definition and normalization of the pdf we can now compute the hadronic cross section from its 
partonic counterpart, like the QED result in Eq.ljZTTJ, as 




(2.28) 



where i,j are the incoming partons with the momentum factions Xij. The partonic energy of the scattering process is 
s — X1X2S with the LHC proton energy of eventually \/S = 14 TeV. The partonic cross section a corresponds to the 
cross sections a computed for example in Eq.( |2.1 l| l. It has to include all the necessary 6 and 6 functions for 
energy-momentum conservation. When we express a general n-particle cross section a including the phase space 
integration, the Xi integrations and the phase space integrations can of course be interchanged, but Jacobians will 



make life hard when we attempt to get them right. In Section 2.1.5 we will discuss an easier way to compute 
kinematic distributions instead of the fully integrated total rate in Eq.( 2.28| l. 
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2.1.4 Hadron collider kinematics 

Hadron colliders have a particular kinematic feature in that event by event we do not know the longitudinal velocity of 
the initial state, i.e. the relative longitudinal boost from the laboratory frame to the partonic center of mass. This 
sensitivity to longitudinal boosts is reflected in the choice of kinematic variables. The first thing we can do is consider 
the projection of all momenta onto the transverse plane. These transverse components are trivially invariant under 
longitudinal boosts because the two are orthogonal to each other. 

In addition, for the production of a single electroweak gauge boson we remember that the produced particle does not 
have any momentum transverse to the beam direction. This reflects the fact that the incoming quarks are collinear with 
the protons, i.e. they have zero transverse momentum. Such a gauge boson not recoiling against anything else cannot 
develop a finite transverse momentum. Of course, once we decay this gauge boson, for example into a pair of muons, 
each muon will have transverse momentum, only their vector sum will be zero: 



E 

final state 



Pt, 



= 



(2.29) 



This is a relation between two-dimensional, not three dimensional vectors. For several particles in the final state we 
can define an azimuthal angle in the plane transverse to the beam direction. While differences of azimuthal angles, 
e.g. two such differences between three particles in the final state, are observables the over-all angle is a symmetry of 
the detector as well as of our physics. 



In addition to the transverse plane we need to parameterize the longitudinal momenta in a way which makes it easy to 
implement longitudinal boosts. In Eq.( |2.28 i we integrate over the two momentum fractions xi,2 and can at best 
determine their product xiX2 — s/S from the final state kinematics. Our task is to replace both, xi and X2 with more 
physical variables which should be well behaved under longitudinal boosts. 
A longitudinal boost for example from the rest frame of a massive particle reads 
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j odd ■ 

1 cosh y + ^j' g J sinh y 
In the first line we already see that the combination of two such exponentials involving yi and y2 is additive 



/ cosh y 
\ sinh y 



(2.30) 



y = yi + 2/2- The rapidity y as defined in Eq.(2.30l we can re-write in a way which allows us to compute it from the 
four-momentum for example in the LHC lab frame 



1, E+pl 1, cosh w + sinh y 1, 

- log = - log '- = - log 

2 ^E-pL 2 ^coshy-sinhy 2 ^ e-v 



y 



(2.31) 



We can explicitly check that the rapidity is indeed additive by applying a second longitudinal boost to {E,pl) in 
Eq.([230| 
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1 cosh y' 
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1 



sinh y' 



E cosh y' + PL sinh y' 
PL cosh y' + E sinh y' 



which gives for the combined rapidity, following its extraction in Eq.( |2.31[ ) 
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2 log 
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(2.32) 



(2.33) 
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This combination of several longitudinal boosts is important in the case of massless particles. They do not have a rest 
frame, which means we can only boost them from one finite-momentum frame to the other For such massless 
particles we can simplify the formula for the rapidity Eq.( 2.31[ ), instead expressing the rapidity in terms of the polar 
angle 9. We use the fact that (only) for massless particles E = \p\ 



1 , E 



PL 1, \P\ 
PL 2 \p\ 



PL 1, 1 

— = o log r 

PL 2 1 



cos ( 



= - log 
2 ^ 



= - log tan-^f] 



(2.34) 



This pseudo-rapidity ry is more handy, but coincides with the actual rapidity only for massless particles. To get an idea 
about the experimental setup at the LHC — in CMS and ATLAS we can observe all particles very well to polar angles 
of 5 to 10 degrees, corresponding to pseudo-rapidities of 2.4 to 3.1. Because this is about the same range as the range 
of the azimuthal angle [0, tt] we define a distance measure inside the detector 



(Ai?)2 = {Ayf + (A<^)2 
= {Arjf + (A0)2 

9 + A9\^ 

1^ 



massless particles 



tan 




(A0)2 + O((A(?)3) 



(2.35) 



The angle 9 is the polar angle of one of the two particles considered and in our leading approximation can be chosen 



as each of them without changing Eq.(2.35 i 



Still studying the production of a single gauge boson at the LHC we can express the final state kinematics in terms of 
two parameters, the invariant mass of the final-state particle and its rapidity. The transverse momentum we aheady 
know is zero. The two incoming and approximately massless protons have the momenta 

Pi = (S,0,0,S) P2^{E,0,0,-E) S={2Ef, (2.36) 

which for the momentum of the final-state gauge boson in terms of the parton momentum fractions means, when we 



include the definition of the rapidity in Eq.(2.30 1 
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{231) 



This result can be combined with x\X2 — q^ / S to obtain 



Xl = 



P 



X2 



S 



(2.38) 



These relations allow us to for example compute the hadronic total cross section for lepton pair production in QED 

(2.39) 
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instead in terms of the hadronic phase space variables xi,2 in terms of the kinematic final state observables and y. 
Remember that the partonic or quark-antiquark cross section a is already integrated over the (symmetric) azimuthal 
angle (p and the polar angle Mandelstam variable t. The transverse momentum of the two leptons is therefore fixed by 
momentum conservation. 
The Jacobian for this change of variables reads 



djq^.v) 

d{xi,X2) 



which inserted into Eq.(2.39 1 gives us 
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(2.40) 



ina Q, f , 2 , X1X2 1 



<?2 g2 



j dq^dy ^ Q] Xifj{xi)x2fj{x2) 



(2.41) 



In contrast to the original form of the integration over the hadronic phase space this form reflects the kinematic 
observables. For the Drell-Yan process at leading order the q^ distribution is the same as mj^, one of the most 
interesting distributions to study because of different contributions from the photon, the Z boson, or extra dimensional 
gravitons. On the other hand, the rapidity integral still suffers from the fact that at hadron colliders we do not know the 
longitudinal kinematics of the initial state and therefore have to integrate over it. 



2.1.5 Phase space integration 

In the previous example we have computed the simple two dimensional distribution, by leaving out the double 
integration in Eq.( 2.41| l 
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dq^dy 
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This expression we can numerically evaluate and compare to experiment. However, the rapidity y and the momentum 
transfer q^ are by no means the only distribution we would like to look at. Moreover, over the parton densities f{x) 
we will have to integrate numerically, so we will have to rely on numerical integration tools no matter what we are 
doing. Looking at a simple (2 2) process we can write the total cross section as 



Ctot 



j d4> Jdcose Jdxi J dx2 Fps\M\^ = dyi---dy4Jps{y)\M\'' , (2.43) 



with an appropriate function Fps. In the second step we have re-written the phase space integral as an integral over the 
four-dimensional unit cube, with the appropriate Jacobian. Like any integral we can numerically evaluate this phase 
space integral by binning the variable we integrate over 

'dyfiy) Y.^Ay),f{y,)^AyJ2f{y,)- (2-44) 

j 







Without any loss of generality we assume that the integration boundaries are • • • 1. The integration variable y we can 
divide into a discrete set of points yj, for example equi -distant in y and by choosing a chain of random numbers 
yje[0, 1]. In the latter case we need to keep track of the bin widths (Ay)^. When we extend the integral over N 
dimensions we can in principle divide each axis into bins and compute the functional values for this grid. For not 
equidistant bins generated by random numbers we again keep track of the associated phase space volume for each 
random number vector. However, once we know these phase space weights for each phase space point there is no 
reason to consider the set of random numbers as in any way linked to the N axes. All we need is a chain of random 
points with an associated phase space weight and their transition matrix element, to integrate over the phase space in 



complete analogy to Eq.(2.44i 



2. 1 Drell- Yan process 
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The obvious question arises how such random numbers can be chosen in a smart way; but before we discuss how to 
best evaluate such an integral numerically, let us first illustrate how this integral is much more useful than just 
providing the total cross section. If we are interested in the distribution of an observable, like for example the 
distribution of the transverse momentum of a muon in the Drell- Yan process, we need to compute da/dpx as a 



function of pT- In terms of Eq.(2.43 i any physical yi distribution is given by 



dyi--- dyN f{y) = / dy 



da 
dyi 



da 
dyi 



= / dy2--- dyN /(y?) = / dyi--- dyN f{y) S{yi - y?) 



(2.45) 
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Numerically we can compute this distribution in two ways: one way corresponds to the first line in Eq.( |2.45[ ) and 
means numerically evaluating the y2 - - - yN integrations and leaving out the yi integration. The result will be a 
function of yi which we then evaluate at different points y". 



The second and much more efficient option corresponds to the second line of Eq.(2.45 1, with the delta distribution 



defined for discretized yi. First, we define an array with the size given by the number of bins in the yi integration. 
Then, for each yi value of the complete yi - - - yN integration we decide where the value yi goes in this array and add 
f{y) to the corresponding column. Finally, we print these columns as a function of yi to see the distribution. This set 
of columns is referred to as a histogram and can be produced using publicly available routines. This histogram 
approach does not sound like much, but imagine we want to compute a distribution da/dpT, where pt(j7) is a 
complicated function of the integration variables and kinematic phase space cuts. We then simply evaluate 



da 
dpT 



dyi--- dyN f{y) S (priy) - Pt) 



(2.46) 



numerically and read off the px distribution as a side product of the calculation of the total rate. Histograms mean that 
computing a total cross section numerically we can trivially extract all distributions in the same process. 



The procedure outlined above has an interesting interpretation. Imagine we do the entire phase space integration 
numerically. Just like computing the interesting observables we can compute the momenta of all external particles. 
These momenta are not all independent, because of energy-momentum conservation, but this can be taken care of. 
The tool which translates the vector of integration variables y into the external momenta is called a 
phase space generator. Because the phase space is not uniquely defined in terms of the integration variables, the phase 
space generator also returns the Jacobian Jps, called the phase space weight. If we think of the integration as an 
integration over the unit cube, this weight needs to be combined with the matrix element squared Once we 

compute the unique phase space configuration (fci, k2,pi - - - )j corresponding to the vector yj, the combined weight 
W — Jps |A^|^ is the probability that this configuration will appear at the LHC. This means we do not only integrate 
over the phase space, we really simulate LHC events. The only complication is that the probability of a certain 
configuration is not only given by the frequency with which it appears, but also by the explicit weight. So when we 
run our numerical integration through the phase space generator and histogram all the distributions we are interested 
in we generate weighted events. These events, i.e. the momenta of all external particles and the weight W, we can for 
example store in a big file. 

This simulation is not yet what experimentalists want — they want to represent the probability of a certain 
configuration appearing only by its frequency. Experimentally measured events do not come with a variable weight, 
either they are recorded or they are not. This means we have to unweight the events by translating the event weight 
into frequency. 

There are two ways to do that. On the one hand, we can look at the minimum event weight and express all other events 
in relative probability to this event. Translating this relative event weight into a frequency means replacing an event 
with the relative weight Wj/W^^n by Wj/W^i„ events in the same phase space point. The problem with this method 
is that we are really dealing with a binned phase space, so we would not know how to distribute these events in the 
given bin. Alternatively, we can start from to the maximum weight Wmax and compute the ratio Wj7Wn,axe[0, 1]. 
Keeping an event with a given probability means we can generate a flat random number re[0, 1] and only keep it if 
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Wj/W^ax > r. The challenge in this translation is that we always lose events. If it was not for the experimentalists we 
would hardly use such unweighted events, but they have good reasons to want such unweighted events which feed best 
through detector simulations. 

The last comment is that if the phase space configuration (fci , fc2, pi • • • )j can be measured, its weight Wj better be 
positive. This is not trivial once we go beyond leading order There, we need to add several contributions to produce a 
physical event, like for example different n-particle final states, and there is no guarantee for each of them to be 
positive. We have to ensure that after adding up all contributions and after integrating over any kind of unphysical 
degrees of freedom we might have introduced, the probability of a physics configuration is positive. From this point of 
view negative values for parton densities /(x) < are in principle not problematic, as long as we always keep a 
positive hadronic rate dcr^p^x > 0. 

Going back to the numerical phase space integration for many particles, it faces two problems. First, the partonic 
phase space for n on-shell particles in the final state has 3(n + 2) — 3 dimensions. If we divide each of these 
directions in 100 bins, the number of phase space points we need to evaluate for a (2 — > 4) process is 100^^ — 10^°, 
which is not realistic. 

To integrate over a large number of dimensions we use Monte Carlo integration. In this approach we define a 
distribution py {y) such that for a one-dimensional integral we can replace the binned discretized phase space integral 
by a discretized version based on a set of random numbers Yj over the integration variables y 

{9{Y)) ^ j\y py{v) g{y) j^llaiyj)- (2A1) 

All we have to make sure is that the probability of returning Yj is given by py (y) for y <Yj < y + dy. As mentioned 



above, also Eq.( 2.47 1 has the advantage that we can naively generalize it to any number of N dimensions, just by 
organizing the random numbers Yj in one large chain instead of an A^-dimensional array. Our iV-dimensional phase 
space integral shown in Eq.( |2.43| l we can re-write in the same manner 

['^"y^.pyiyi-im.) -^IVm^ (2.48) 







Py(2/) ' ' \py{Y)/ N^py{Y, 







To compute the integral we have to average over all phase space values of f /py. In the ideal case where we exactly 
know the form of the integrand and can map it into our random numbers, the error of the numerical integration will be 
zero. So what we have to find is a way to encode f{Yj) into pY{Yj). This task is called importance sampling and you 
can find some documentation for example on the standard implementation VEGAS to look at the details. 
Technically, VEGAS will call the function which computes the weight W — Jps for a number of phase space 
points and average over these points, but including another weight factor Wmc representing the importance sampling. 
If we want to extract distributions via histograms we have to add the total weight W — Wmc^^ps to the columns. 

The second numerical challenge is that the matrix elements for interesting processes are by no means flat. We would 
therefore like to help our adaptive or importance sampling Monte Carlo by defining the integration variables such that 
the integrand becomes as flat as possible. For example for the integration over the partonic momentum fraction we 
know that the integrand usual falls off as 1/x. In that situation we can substitute 

dx^^f d\ogx (^^) ' [ dlogxC, (2.49) 

X J log (5 \ dx J X JlogS 

to obtain a flat integrand. There exists an even more impressive and relevant example: intermediate particles with 
Breit-Wigner propagators squared are particularly painful to integrate over the momentum s = p^ flowing through it 

^(^' = 7 2/2^ 2r2 ■ ^2.50) 

For example, a Standard-Model Higgs boson with a mass of 120 GeV has a width around 0.005 GeV, which means 
that the integration over the invariant mass of the Higgs decay products ^/s requires a relative resolution of 10^^. 



2.2 Ultraviolet divergences 



63 



Since this is unlikely to be achievable what we should really do is find a substitution which produces the inverse 
Breit-Wigner as a Jacobian and leads to a flat integrand — et voila 



/ 



C f , ^dz\ ' C 

ds ^t-t; — - = dz 



(s — rn^)^ + m^T^ J \ds J {s — m^)^ + im?T^ 

(s-m2)2 + m2r2 C 



dz 



1 



mT (s — rn^)^ + m^r^ 

^2 



dz C with tanz — . (2.51) 

mT J niT 

This is the most useful phase space mapping in LHC physics. Of course, any adaptive Monte Carlo will eventually 
converge on such an integrand, but a well-chosen set of integration parameters will speed up simulations very 
significantly. 

2.2 Ultraviolet divergences 

From general field theory we know that when we are interested for example in cross section prediction with higher 
precision we need to compute further terms in its perturbative series in ag- This computation will lead to ultraviolet 
divergences which can be absorbed into counter terms for any parameter in the Lagrangian. The crucial feature is that 
for a renormalizable theory like our Standard Model including a Higgs boson the number of counter terms is finite, 
which means once we know all parameters including their counter terms our theory becomes predictive. 
In Section [23] we will see that in QCD processes we also encounter another kind of divergences. They arise from the 
infrared momentum regime. Infrared divergences is what this lecture is really going to be about, but before dealing 
with them it is very instructive to see what happens to the much better understood ultraviolet divergences. In 
Section 2.2.1 we will review how such ultraviolet divergences arise and how they are removed. In Section 2.2.2 we 



will review how running parameters appear in this procedure, i.e. how scale dependence is linked to the appearance of 
divergences. Finally, in Section [2.2.3| we will interpret the use of running parameters physically and see that in 
perturbation theory they re-sum classes of logarithms to all orders in perturbation theory. Later in Section [23] we will 
follow exactly the same steps for infrared divergences and develop some crucial features of hadron collider physics. 

2.2.1 Counter terms 

Renormalization, i.e. the proper treatment of ultraviolet divergences, is one of the most important things to 
understand about field theories; more detailed discussions you can find in any book on advanced field theory. The 
aspect of renormalization which will guide us through this section is the appearance of the renormalization scale. 
In perturbation theory, scales automatically arise from the regularization of infrared or ultraviolet divergences. We can 
see this writing down a simple scalar loop integral, corresponding to two virtual scalars with masses mi 2 and with the 



external momentum p flowing through a diagram similar to those summed in Section 2.1.2 



B(p2;mi,TO2)= / 21—^2 2 • (2.52) 

Such two-point functions appear for example in the gluon self energy, with massless scalars for ghosts, with a Dirac 
trace in the numerator for quarks, and with massive scalars for supersymmetric scalar quarks. In those cases the two 
masses are identical m\ = m-i. The integration measure l/(167r2) is dictated by the usual Feynman rules for the 



integration over loop momenta. Counting powers of q in Eq.(2.52 1 we see that the integrand scales like l/g in the 
ultraviolet, so it is logarithmically divergent and we have to regularize it. Regularizing means expressing the 
divergence in a well-defined manner or scheme allowing us to get rid of it by renormalization. 

One regularization scheme is to introduce a cutoff into the momentum integral A, for example through the so-called 
Pauli-Villars regularization. Because the ultraviolet behavior of the integrand or integral cannot depend on any 



parameter living at a small energy scales, the parameterization of the ultraviolet divergence in Eq.(2.52 1 cannot 
involve the mass m or the external momentum p^. The scalar two-point function has mass dimension zero, so its 
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divergence has to be proportional to log(A//ifl;) with a dimensionless prefactor and some scale which is an artifact 
of the regularization of such a Feynman diagram. 



A more elegant regularization scheme is dimensional regularization. It is designed not to break gauge invariance and 
naively seems to not introduce a mass scale /i/j. When we shift the momentum integration from 4 to 4 — 2e 
dimensions and use analytic continuation in the number of space-time dimensions to renormalize the theory a 
renormalization scale /i^ nevertheless appears once we ensure the two-point function and with it observables like 
cross sections keep their correct mass dimension 



167r2 



(47r)2 



(2.53) 



At the end, the scale fj, might become irrelevant and drop out after renormalization and analytic continuation, but to 
be on the save side we keep it. The constants Ci in the series in 1 /e depend on the loop integral we are considering. To 
regularize the ultraviolet divergence we assume e > and find mathematically well defined poles 1 /e. Defining scalar 
integrals with the integration measure l/{iiT^) will make for example C_i come out as of the order 0(1). This is the 
reason we usually find factors l/(47r)2 = tt^/ (27r)^ in front of the loop integrals. 

The poles in 1/e will cancel with the universal counter terms once we renormalize the theory. Counter terms we 
include by shifting parameters in the Lagrangian and the leading order matrix element. They cancel the poles in the 
combined leading order and virtual one-loop prediction 
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The dots indicate higher orders in g^, fo r example absorbing the 5g corrections in the leading order and virtual 
interference. As we can see in Eq.(2.54 1 the counter terms do not come with a factor ji]^ in front. Therefore, while the 
poles 1/e cancel just fine, the scale factor /i^*^ will not be matched between the actual ultraviolet divergence and the 
counter term. 

We can keep track of the renormalization scale best by expanding the prefactor of the regularized but not yet 
renormalized integral in Eq.( |2.53| l in a Taylor series in e, no question asked about convergence radii 



+ Co + 0{e) 



= [l + 2elog>K + 0(e2 



+ Co + O{e) 



e 



Co + C_i log ^il + 0{e) 



Co + C_i log 



/4 
M2 



0[e) 



(2.55) 



In the last step we have by hand corrected for the fact that log with a mass dimension inside the logarithm cannot 
appear in our calculations. From somewhere else in our calculation the scale dependence logarithm will be matched 
with a log where is the typical mass or energy scale in our process. This little argument shows that also in 
dimensional regularization we introduce a mass scale /i^f which appears as log(/z|j./Af 2) in the renormalized 
expression for our observables. There is no way of removing ultraviolet divergences without introducing the 
renormalization scale if we keep track of the mass dimensions of our regularized result. 

In Eq.( |2.55| l there appear two contributions to a given observable, the expected Cq and the renormalization induced 
C_i. Because the factors C_i are linked to the counter terms in the theory we can often guess them without actually 
computing the loop integral, which is very useful in cases where they numerically dominate. 



Counter terms as they schematically appear in Eq.(2.54 1 are not uniquely defined. They need to include a given 
divergence, to return finite observables, but we are free to add any finite contribution we want. This opens many ways 
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to define a counter term for example based on physical processes where counter terms do not only cancel the pole but 
also finite contributions at a given order in perturbation theory. Needless to say, such schemes do not automatically 
work universally. An example for such a physical renormalization scheme is the on-shell scheme for masses, where 
we define a counter term such that external on-shell particles do not receive any corrections to their masses. For the 



top mass this means that just like in Eq.(2.54 1 we replace the leading order mass with the bare mass, for which we 



then insert the expression in terms of the renormalized mass and the counter term 

hare _ _|_ 



+ ^ (3 (- i + 7. - log(4vr) - log § ) - 4 + 3 log ^ 



The convenient scale dependent pole 1/e includes the universal additional terms like the Euler gamma function and 
the scaling logarithm. This logarithm is the big problem in this universality argument, since we need to introduce the 
arbitrary energy scale M to separate the universal logarithm of the renormalization scale and the parameter-dependent 
logarithm of the physical process. 

A theoretical problem with this on-shell renormalization scheme is that it is not gauge invariant. On the other hand, it 
describes for example the kinematic features of top pair production at hadron colliders in a stable perturbation series. 
This means that once we define a more appropriate scheme for heavy particle masses in collider production 
mechanisms it better be numerically close to the pole mass. For the computation of total cross sections at hadron 
colliders or the production thresholds at e+e^ colliders the pole mass is not well suited at all, but as we will see in 
Section[3]this is not where at the LHC we expect to measure particle masses, so we should do fine with something 
very similar to the pole mass. 

Another example for a process dependent renormalization scheme is the mixing of 7 and Z propagators. There we 
choose the counter term of the weak mixing angle such that an on-shell Z boson cannot oscillate into a photon, and 
vice versa. We can generaUze this scheme for mixing scalars as they for example appear in supersymmetry, but it is 
not gauge invariant with respect to the weak gauge symmetries of the Standard Model either. For QCD corrections, on 
the other hand, it is the most convenient scheme keeping all exchange symmetries of the two scalars. 

To finalize this discussion of process dependent mass renormalization we quote the result for a scalar supersymmetric 
quark, a squark, where in the on-shell scheme we find 
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with r — m'^/nii. The interesting aspect of this squark mass counter term is that it also depends on the gluino mass, 
not just the squark mass itself. The reason why QCD counter terms tend to depend only on the renormalized quantity 
itself is that the gluon is massless. In the limit of vanishing gluino contribution the squark mass counter term is again 
only proportional to the squark mass itself 
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One common feature of all mass counter terms listed above is 6m oc to, which means that we actually encounter a 
multiplicative renormalization 

m^"^" = Zmm={l + SZrn) TO = I 1 + — ) TO = TO + 5to , (2.59) 
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with dZm — 5m /m linking the two ways of writing the mass counter term. This form implies that particles with zero 
mass will not obtain a finite mass through renormalization. If we remember that chiral symmetry protects a 
Lagrangian from acquiring fermion masses this means that on-shell renormalization does not break this symmetry. A 
massless theory cannot become massive by mass renormalization. Regularization and renormalization schemes which 
do not break symmetries of the Lagrangian are ideal. 

When we introduce counter terms in general field theory we usually choose a slightly more model independent 
scheme — we define a renormalization point. This is the energy scale at which the counter terms cancels all higher 
order contributions, divergent as well as finite. The best known example is the electric charge which we renormalize in 
the Thomson limit of zero momentum transfer through the photon propagator 

e ^ e''"'= ^e + 5e. (2.60) 



Looking back at 5mt as defined in Eq.( 2.56 1 we also see a way to define a completely general counter term: if 
dimensional regularization, i.e. the introduction of 4 — 2e dimensions does not break any of the symmetries of our 
Lagrangian, like Lorentz symmetry or gauge symmetries, we should simply subtract the ultraviolet pole and nothing 
else. The only question is: do we subtract 1/e (MS scheme) or do we subtract 1/e in the MS scheme . In the MS 
scheme the counter term is then scale dependent. 

Carefully counting, there are three scales present in such a scheme. First, there is the physical scale in the process. In 
our case of a top self energy this is for example the top mass mt appearing in the matrix element for the process 
pp — > ti. Next, there is the renormalization scale /z^, a reference scale which is part of the definition of any counter 
term. And last but not least, there is the scale M separating the counter term from the process dependent result, which 
we can choose however we want, but which as we will see implies a running of the counter term. The role of this scale 
M will become clear when we go through the example of the running strong coupling as. Of course, we would prefer 
to choose all three scales the same, but in a complex physical process this might not always be possible. For example, 
any massive (2 3) production process naturally involves several external physical scales. 

Just a side remark for completeness: a one loop integral which has no intrinsic mass scale is the two-point function 
with zero mass in the loop and zero momentum flowing through the integral: B{p^ ~ 0; 0, 0). It appears for example 
in the self energy corrections of external quarks and gluons. Based on dimensional arguments this integral has to 
vanish altogether. On the other hand, we know that like any massive two-point function it has to be ultraviolet 
divergent B ^ 1/euv because setting all internal and external mass scales to zero is nothing special from an 
ultraviolet point of view. This can only work if the scalar integral also has an infrared divergence appearing in 
dimensional regularization. We can then write the entire massless two-point function as 

d^q I 1 _ ivr^ / 1 1 
16^ '^{q+pY ~ 16^ W " 



i?b2 = 0;0,0)= / ^ = _ ( — --) , (2.61) 



keeping track of the divergent contributions from the infrared and the ultraviolet regimes. For this particular integral 
they precisely cancel, so the result for -8(0; 0, 0) is zero, but setting it to zero too early will spoil any ultraviolet and 
infrared finiteness tests in practical calculations. Treating the two divergences strictly separately and dealing with 
them one after the other also ensures that for ultraviolet divergences we can choose e > while for infrared 
divergences we require e < 0. 

2.2.2 Running strong coupling 

To get an idea what these different scales which appear in the process of renormalization mean let us compute such a 
scale dependent parameter, namely the running strong coupling as(/i^). The Drell-Yan process is one of the very few 
relevant process at hadron colliders where the strong coupling does not appear at tree level, so we cannot use it as our 
toy process this time. Another simple process where we can study this coupling is bottom pair production at the LHC, 
where at some energy range we will be dominated by valence quarks: qq bb. The only Feynman diagram is an 
s-channel off-shell gluon with a momentum flow p^ = s. 
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At next-to-leading order this gluon propagator will be corrected by self energy loops, where the gluon splits into two 
quarks or gluons and re-combines before it produces the two final state bottoms. Let us for now assume that all quarks 
are massless. The Feynman diagrams for the gluon self energy include a quark look, a gluon loop, and the ghost loop 
which removes the unphysical degrees of freedom of the gluon inside the loop. 




The gluon self energy correction or vacuum polarization, as propagator corrections to gauge bosons are usually 
labelled, will be a scalar, i.e. all fermion lines close and the Dirac trace is computed inside the loop. In color space the 
self energy will (hopefully) be diagonal, just like the gluon propagator itself, so we can ignore the color indices for 
now. In unitary gauge the gluon propagator is proportional to the transverse tensor T^'' = g^"^ ~ p'^p'^/p^. As 
mentioned in the context of the effective gluon-Higgs coupling, the same should be true for the gluon self energy, 
which we therefore write as = HT^''. A useful simple relation is T'^'TP = T^p and T^"'gP = T^^p. Including 
the gluon, quark, and ghost loops the regularized gluon self energy with a momentum flow p^ reads 
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The minus sign arises from the factors i in the propagators, as shown in Eq.( 2. 19 1. The number of fermions coupling 
to the gluons is n/. From the comments on B{p^; 0, 0) we could guess that the loop integrals will only give a 
logarithm \ogp^ which is then matched by the logarithm log Af ^ implicitly included in the definition of e. The factor 
bo arises from the one-loop corrections to the gluon self energy, i.e. from diagrams which include one additional 
factor as- Strictly speaking, this form is the first term in a perturbative series in the strong coupling as — g1/ (47r). 
Later on, we will indicate where additional higher order corrections would enter 



In the second step of Eq.(2.62 1 we have sneaked in additional contributions to the renormalization of the strong 



coupling from the other one-loop diagrams in the process, replacing the factor 13/6 by a factor 1 1/3. This is related to 
the fact that there are actually three types of divergent virtual-gluon diagrams in the physical process qq — > bb: the 

1/2 

external quark self energies with renormalization factors Zj , the internal gluon self energy Za, and the vertex 
corrections ZAff- The only physical parameters we can renormalize in this process are the strong coupling and, if 
finite, the bottom mass. Wave function renormalization constants are not physical. The entire divergence in our 
qq — > bb process which needs to be absorbed in the strong coupling Zg is given by the combination 
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We can check this definition of Zg by comparing all vertices in which the strong coupling gs appears, namely the 
gluon coupling to quarks, ghosts as well as the triple and quartic gluon vertex. All of them need to have the same 
divergence structure 

ZaU ! ZArm ! ! / 
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If we had done the same calculation in QED, i.e. looked for a running electric charge, we would have found that the 
vacuum polarization diagrams for the photon do account for the entire counter term of the electric charge. The other 
two renormalization constants Zj^ff and Zf cancel because of gauge invariance. 



In contrast to QED, the strong coupling diverges in the Thomson limit because QCD is confined towards large 
distances and weakly coupled at small distan ces. L acking a well-enough motivated reference point we are tempted to 
renormalize in the MS scheme. From Eq.(2.62 1 we know that the ultraviolet pole which needs to be cancelled by 
the counter term is proportional to the function 60 
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Only in the last step we have explicitly included the scale dependence of the counter term. Because the bare coupling 
does not depend on any scales, this means that ag depends on the unphysical scale M. Similar to the top mass 
renormalization scheme we can switch to a more physical scheme for the strong coupling as well: we can absorb also 
the finite contributions of n(/^^/p^) into the strong coupling by simply identifying AP — p^. This implies based 
again on Eq.(2.62 1 
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This formula defines a running coupling as{p^), because the definition of the coupling now has to account for a 
poss ible sh ift between the original argument and the scale AP coming out of the MS scheme. Since according to 
Eqs.(2.65 1 and (2.66 1 the bare strong coupUng can be expressed in terms of as{AP) as well as in terms of asip^) we 



can link the two scales through 



d\og p^ 



M2 



(2.67) 



To the given loop order the argument of the strong coupling squared in this formula can be neglected — its effect is of 
higher order. 



In this first formula for the running coupling constant we see that &o is positive in the Standard Model. This means that 
forp2 > AP the ultraviolet limit of the strong coupling is zero. This makes QCD an asymptotically free theory. We 
can compute the function 60 in general models by simply adding all contributions of strongly interacting particles in 
this loop 

1 



127r 



(2.68) 



colored states 



where we need to know some kind of counting factor Dj which is -1 1 for a vector boson (gluon), +4 for a Dirac 
fermion (quark), +2 for a Majorana fermion (gluino), +1 for a complex scalar (squark) and +1/2 for a real scalar. The 
color charges are Tr = 1/2 for the fundamental representation of 5[/(3) and Ca = Nc for the adjoint representation. 
The masses of the loop particles are not relevant in this approximation because we are only interested in the ultraviolet 
regime of QCD where all particles can be regarded massless. When we really model the running of we need to take 
into account threshold effects of heavy particles, because particles can only contribute to the running of ctg at scales 



above their mass scale. This is why the R ratio computed in Eq.( 2. 12 1 is so interesting once we vary the energy of the 
incoming electron-positron pair. 
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We can do even better than this fixed order in perturbation theory: while the correction to ag in Eq.(2.66 1 is 
perturbatively suppressed by the usual factor Q;s/(47r) it includes a logarithm of a ratio of scales which does not need 
to be small. Instead of simply including these gluon self energy corrections at a given order in perturbation theory we 
can instead include chains of one-loop diagrams with 11 appearing many times in the off-shell gluon propagator This 



series of Feynman diagrams is identical to the one we sum for the mass renormalization in Eq.( 2. 19 1. It means we 
replace the off-shell gluon propagator by (schematically written without the factors i) 
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(2.69) 



To avoid indices we abbreviate T^T^ = T • T which make sense because of (T • T • = Tt'PT^T^ = T'"'. This 
re-summation of the logarithm which appears in the next-to-leading order corrections to as moves the finite shift in as 
shown in Eqs.( |2.62} and ( |2.66| l into the denominator, while we assume that the pole will be properly taken care off in 
any of the schemes we discuss 
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(2.70) 



Just as in the case without re-summation, we can use this complete formula to relate the values of a, at two reference 
points, i.e. we consider it a renormalization group equation (RGE) which evolves physical parameters from one scale 
to another in analogy to the fixed order version in Eq.( 2.67| i 
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(2.71) 



The factor as inside the parentheses we can again evaluate at either of the two scales, the difference is going to be a 
higher order effect. When we differentiate as{p^) with respect to the momentum transfer p2 we find, using the 
relation d/dx{l/as) — — l/o^s das/dx 



1 da. 



as dlogp2 



1 



'dlogp2 cts 



da. 



, da^ 



dp^ d log p2 



n=0 



(2.72) 



This is the famous running of the strong coupling constant including all higher order terms &„ 



It is customary to replace the renormalization point of as in Eq.( |2.70| l with a reference scale where the denominator of 
the re-summed running coupling crosses zero and the running coupling itself should diverge. This is the Landau pole 



of the strong coupling, as discussed for the Higgs self coupling in Section 1.2.3 At one loop order this reads 
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(2.73) 
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bo log 



p^ 



A2 



This scheme can be generalized to any order in perturbative QCD and is not that different from the Thomson limit 
renormalization scheme of QED, except that with the introduction of Aqcd we are choosing a reference point which is 
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particularly hard to compute perturbatively. One thing that is interesting in the way we introduce Aqcd is the fact that 
we introduce a scale into our theory without ever setting it. All we did was renormalize a coupling which becomes 
strong at large energies and search for the mass scale of this strong interaction. This trick is called 
dimensional transmutation. 



In terms of language, there is a little bit of confusion between field theorists and phenomenologists: up to now we 
have introduced the renormalization scale fi^j as the renormalization point, for example of the strong coupling 
constant. In the MS scheme, the subtraction of 1/e shifts the scale dependence of the strong coupling to AP and 
moves the logarithm log A/^/Aq^q into the definition of the renormalized parameter. This is what we will from now 
on call the renormalization scale in the phenomenological sense, i.e. the argument we evaluate at. Throughout this 
section we will keep the symbol M for this renormalization scale in the MS scheme, but from Section 
shift back to /i/j instead of M as the argument of the running coupling. 



2.3 



on we will 



2.2.3 Re-summing scaling logarithms 



In the last Section 2.2.2 we have introduced the running strong coupling in a fairly abstract manner For example. 



re-summing diagrams and changing the running of as from Eq.(2.67 1 to Eq.(2.72 1 we did not yet link to any physics. 
In what way does the re-summation of the one-loop diagrams for the s channel gluon improve our prediction of the 
bottom pair production rate at the LHC? 



To illustrate those effects we best look at a simple observable which depends on just one energy scale, namely p^. The 
first observable coming to mind is again the Drell-Yan cross section (j{qq — > but since we are not really sure 

what to do with the parton densities which are included in the actual hadronic observable, we better use an observable 
at an e+e^ collider. Something that will work and includes as at least in the one-loop corrections is the R parameter 
given in Eq.( |2.12| l 
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(2.74) 



The numerical value at leading order assumes five quarks. Including higher order corrections we can express the result 
in a power series in the renormalized strong coupling as- In the MS scheme we subtract l/e{^ji/M) and in general 
include a scale dependence on M in the individual prefactors r„ 
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(2.75) 



The r„ we can assume to be dimensionless — if they are not, we can scale R appropriately using p^. This implies that 
the r„ only depend on ratios of two scales, the externally fixed on the one hand and the artificial AP on the other. 



At the same time we know that R is an observable, which means that including all orders in perturbation theory it 
cannot depend on any artificial scale choices AI. Writing this dependence as a total derivative and setting it to zero we 
find an equation which would be called a Callan-Symanzik equation if instead of the running coupling we had 
included a running mass 
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(2.76) 
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In the second line we have to remember that the M dependence of as is already included in the appearance of /3, so 
as should be considered a variable by itself. This perturbative series in as has to vanish in each order of perturbation 
theory. The non-trivial structure, namely the mix of r„ derivatives and the perturbative terms in the /3 function we can 
read off the term in Eq.(2.76 1: first, we have the appropriate NNNLO corrections r^^. Next, we have one loop in the 



gluon propagator bo and two loops for example in the vertex r2. And finally, we need the two-loop diagram for the 



gluon propagator bi and a one-loop vertex correction ri. The kind-of Callan-Symanzik equation Eq.(2.76 1 requires 
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(2.77) 



The dependence on the argument A/^ vanishes for rp and ri . Keeping in mind that there will be integration constants 
c„ independent of AP and that another, in our simple case unique momentum scale has to cancel the mass units 
inside log we find 
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(2.78) 



This chain of r„ values looks like we should interpret the apparent fixed-order perturbative series for R in Eq.(2.75 1 as 



a series which implicitly includes terms of the order log"^ AP/p^ in each r„. They can become problematic if this 
logarithm becomes large enough to spoil the fast convergence in terms of as ^ 0.1, evaluating the observable R at 
scales far away from the scale choice for the strong coupling constant M. 



Instead of the series in r„ we can use the conditions in Eq.( 2.78| l to express R in terms of the c„ and collect the 
logarithms appearing with each c„. The geometric series we then resum to 
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(2.79) 



In the last step we use what we know about the running coupling from Eq.( 2.71| i. Note that in contrast to the r„ 
integration constants the c„ are by definition independent of /AP and therefore more suitable as a perturbative 
series in the presence of potentially large logarithms. 
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This re-organization of the perturbation series for R can be interpreted as re-summing all logarithms of the kind 
logM^/p^ in the new organization of the perturbative series and absorbing them into the running strong coupling 
evaluated at the scale p^. In this manner, all scale dependence in the perturbative series for the dimensionless 
observable R is moved into as- In Eq.( |2.79[ ) we also see that this series in c„ will never lead to a scale-invariant result 
when we include a finite order in perturbation theory. 

Before moving on we collect the logic of the argument given in this section: when we regularize an ultraviolet 
divergence we automatically introduce a reference scale fij^. Naively, this could be a ultraviolet cutoff scale, but even 
the seemingly scale invariant dimensional regularization in the conformal limit of our field theory cannot avoid the 
introduction of a scale. There are several ways of dealing with such a scale: first, we can renormalize our parameter at 
a reference point. Secondly, we can define a running parameter, i.e. absorb the scale logarithm into the MS counter 
term. In that case introducing Aqcd leaves us with a compact form of the running coupling Q!s(M^; Aqcd)- 
Strictly speaking, at each order in perturbation theory the scale dependence should vanish together with the ultraviolet 
poles, as long as there is only one scale affecting a given observable. However, defining the running strong coupling 
we sum one-loop vacuum polarization graphs. Even when we compute an observable at a given loop order, we 
implicitly include higher order contributions. They lead to a dependence of our perturbative result on the artificial 
scale AP, which phenomenologists refer to as renormalization scale dependence. 

Using the R ratio we see what our definition of the running coupling means in terms of re-summing logarithms: 
reorganizing our perturbative series to get rid of the ultraviolet divergence as{p^) re-sums the scale logarithms 
logp^/M^ to all orders in perturbation theory. We will need this picture once we introduce infrared divergences in the 
following section. 



2.3 Infrared divergences 



After this brief excursion into ultraviolet divergences and renormalization we can return to the original example, the 
Drell-Yan process. Last, we wrote down the hadronic cross sections in terms of parton distributions at leading order in 



Eq.(2.39 1. At this stage particle distributions (pdfs) are only functions of the collinear momentum fraction of the 
partons inside the proton about which from a theory point of view we only know a set of sum rules. 
The perturbative question we need to ask for fi^fJ,^ production at the LHC is: what happens if together with the two 
leptons we produce additional jets which for one reason or another we do not observe in the detector. Such jets could 
for example come from the radiation of a gluon from the initial state quarks. In Section[2.3.1|we will study the 



kinematics of radiating such jets and specify the infrared divergences this leads to. In Sections 2.3.2 and 2.3.3 we will 



show that these divergences have a generic structure and can be absorbed into a re-definition of the parton densities. 



similar to an ultraviolet renormalization of a Lagrangian parameter In Section 2.3.4 and 2.3.5 we will again follow the 



example of the ultraviolet divergences and specify what absorbing these divergences means in terms logarithms 
appearing in QCD calculations. 

Throughout this writeup we will use the terms jets and final state partons synonymously. This is not really correct 



once we include jet algorithms and hadronization. On the other hand, in Section 3.2.2 we will see that the purpose of a 
jet algorithm is to take us from some kind of energy deposition in the calorimeter to the parton radiated in the hard 
process. The two should therefore be closely related. 



2.3.1 Single jet radiation 

Let us get back to the radiation of additional partons in the Drell-Yan process. We can start for example by computing 
the cross section for the partonic process qq — > Zg. However, this partonic process involves renormalization of 
ultraviolet divergences as well as loop diagrams which we have to include before we can say anything reasonable, i.e. 
ultraviolet and infrared finite. 

To make life easier and still learn about the structure of collinear infrared divergences we instead look at the crossed 
process qg -> Zq. It should behave similar to any other (2 — > 2) jet radiation, except that it has a different incoming 
state than the leading-order Drell-Yan process and hence does not involve virtual corrections. This means we do not 
have to deal with ultraviolet divergences and renormalization, and can concentrate on parton or jet radiation from the 
initial state. Moreover, let us go back to Z production instead of a photon, to avoid confusion with massless particles 
which are not radiated jets. 
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The amplitude for this (2 2) process is — modulo charges and averaging factors, but including all Mandelstam 
variables 

+ . (2.80) 

s st 

The Mandelstam variable t for one massless final state particle can be expressed as i = — s(l — t)j/ in terms of the 
rescaled gluon-emission angle y = (1 — cos0)/2 and r = m\/ s. Similarly, we obtain u = — s(l — t)(1 — y), so as a 
first check we can confirm that t + u = — s(l — t) — —s + m\. The collinear limit when the gluon is radiated in the 
beam direction is given by y — > 0, corresponding to negative t — > with finite u — —s + m\. In this limit the matrix 
element can also be written as 

lA^I ^^^-+0{y). (2.81) 



This expression is divergent for collinear gluon radiation or splitting, i.e. for small angles y. We can translate this 1 /y 
divergence for example into the transverse momentum of the gluon or Z 

spl =tu = s^l - t)2 y{l -y)^{s~ mlfy + Oiy^) (2.82) 

In the collinear limit our matrix element squared in Eq.( 2.81[ ) becomes 



,.,,2 — 2sm?, + 2™! (s — On 



The matrix element for the tree-level process qg — > Zq has a leading divergence proportional to To compute the 

total cross section for this process we need to integrate the matrix element over the entire two-particle phase space. 



Starting from Eq.(2.41 1 and using the appropriate Jacobian this integration can be written in terms of the reduced 
angle y. Approximating the matrix element as C"/y or C /p^, we then integrate 



dy— = / dp^j,— = 2 / dpTPT^^2C dpT— = 2C log^ (2.84) 



ynm, y Jp:mn Pj, J ^mn Pj, J ^mm Pt Pj, 

The form C/p^ for the matrix element is of course only valid in the collinear limit; in the non-collinear phase space C 
is not a constant. However, Eq.( |2.84| i describes well the collinear divergence arising from quark radiation at the LHC. 

Next, we follow the same strategy as for the ultraviolet divergence. First, we regularize the divergence for example 
using dimensional regularization. Then, we find a well-defined way to get rid of it. Dimensional regularization means 
writing the two-particle phase space in n = 4 — 2e dimensions. Just for reference, the complete formula in terms of 
the angular variable y reads 

In the second step we only keep the factors we are interested in. The additional factor l/y'^ regularizes the integral at 
y — ?> 0, as long as e < by slightly increasing the suppression of the integrand in the infrared regime. After 
integrating the leading collinear divergence l/y^^' we are left with a pole l/(— e). This regularization procedure is 
symmetric in y o (1 — y). What is important to notice is again the appearance of a scale with the n-dimensional 
integral. This scale arises from the infrared regularization of the phase space integral and is referred to as 
factorization scale . The actual removal of the infrared pole — corresponding to the renormalization in the ultraviolet 
case — is called mass factorization and works exactly the same way as renormalizing a parameter: in a well-defined 
scheme we simply subtract the pole from the fixed-order matrix element squared. 



2.3.2 Parton splitting 

From the discussion of the process qg — > Zq we can at least hope that after taking care of all other infrared and 
ultraviolet divergences the collinear structure of the process qq Zg will be similar In this section we will show that 
we can indeed write all collinear divergences in a universal form, independent of the hard process which we choose as 
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Figure 9: Splitting of one gluon into two gluons. Figure from Ref. Il24ll . 



the Drell-Yan process. In the collinear limit, the radiation of additional partons or the splitting into additional partons 
will be described by universal splitting functions. 



Infrared divergences occur for massless particles in the initial or final state, so we need to go through all ways 
incoming or outgoing gluons and quark can split into each other The description of the factorized phase space, with 
which we will start, is common to all these different channels. The first and at the LHC most important case is the 
splitting of one gluon into two, shown in Figure|9j where the two daughter gluons are close to mass shell while the 
mother has to have a finite positive invariant mass pl,p^. We again assign the direction of the momenta as 
Pa — —pb — Pc, which means we have to take care of minus signs in the particle energies. The kinematics of this 
approximately collinear process we can describe in terms of the energy fractions z and 1 — z defined as 

z = = 1 - M {-Pb-Pc? - '2{PbPc) = 2z(i -z){i- cose)El = z{i z)Ele^ + o{e^) 



^ e = eb + e,-r^ J . , (2.86) 

\Ea\ y Z[l - Z) 

in the collinear limit and in terms of the opening angle between pb and pc- Because p^ > we call this final state 
splitting configuration time-like branching. For this configuration we can write down the so-called 
Sudakov decomposition of the four-momenta 

-Pa ^Pb+Pc^ {-zpa + /3n + pt) + (-(1 - z)pa - l3n - pt) ■ (2.87) 

It defines an arbitrary unit four-vector n, a component orthogonal to the mother momentum and n, i.e. pa 

(PaPr) = = [npT], and a free factor /3. This way, we can specify n such that it defines the direction of the Pb-Pc 

decay plane. In this decomposition we can set only one invariant mass to zero, for example that of a radiated gluon 

pI = 0. The second final state will have a finite invariant mass p^ ^ 0. 

Relative to pa we can split the opening angle 6 for massless partons according to Figure|9] 

e = e, + e. and J = ^f^V' = — ^ = A = - (2.88) 



Using this specific phase space parameterization we can divide an {n + l)-particle process into an n-particle process 
and a splitting process of quarks and gluons. First, this requires us to split the (n + l)-particle phase space alone into 
an n-particle phase space and the collinear splitting. The general (n + 1) -particle phase space separating off the 
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n-paiticle contribution 
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(2.89) 



is best expressed in terms of the energy fraction z and the azimuthal angle (/). 

In other words, separating the {n + l)-particle space into an n-particle phase space and a (1 — >^ 2) spHtting phase 
space is possible without any approximation, and all we have to take care of is the correct prefactors in the new 
parameterization. The third direction of p^ we can translate into 2; in a convenient reference frame for the momenta 
appearing in the Sudakov decomposition 
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(2.90) 



This choice h as the special feature that — which allows us to derive /3 from the momentum parameterization 
shown in Eq.(2.87 1 and the additional condition that p^ = 



pI = (-(1 - z)pa ~ I3n- Pt) 

= {l-zfpl+p^T + W{l-z){npa) 
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(2.91) 



Starting from Eq.(2.87 1 for the z component of pc, expressing Pa and pT following Eq.( |2.90 l and inserting this value 
for /3 gives us 
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(2.92) 



In addition to substituting dpc,3 by dz in Eq.(2.89 1 we also replace dpT with dp^ according to 
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This gives us the final result for the separated collinear phase space 
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where in the second step we assume a spherical symmetry. 
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Adding the transition matrix elements to this factorization of the phase space and ignoring the initial-state flux factor 
which is common to both processes we can now postulate a full factorization for one collinear emission and in the 
coUinear approximation 
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(2.95) 



This last step is an assumption we will now proceed to show step by step by constr ucting the appropriate 
splitting kernels P{z) for all different quark and gluon configurations. If Eq.(2.95 i holds true it means we can 
compute the (n + 1) particle amplitude squared from the n-particle case convoluted with the appropriate splitting 
kernel. Using ddn ^ \MnV d^n and gl = Anas we can write this relation in its most common form 



dpi 

pI 



dz — P(z) 



(2.96) 



Reminding ourselves that relations of the kind |A^„+i p = p|A^„ P can typically be summed, for example for the case 
of successive soft photon radiation in QED, we see that Eq.( |2.96[ l is not the final answer. It does not include the 
necessary phase space factor 1/n! from identical bosons in the final state which leads to the simple exponentiation. 

As the first parton splitting in QCD we study a gluon splitting into two gluons, shown in Figure |9] To compute its 
transition amplitude we parameterize all gluon momenta and polarizations. With respect to the scattering plane 
opened by pf, and pc all three gluons have two transverse polarizations, one in the plane, e" , and one perpendicular to 
it, In the limit of small scattering angles, the three parallel as well as the three perpendicular polarization vectors 
are aligned. The perpendicular polarizations are also orthogonal to all three gluon momenta, while the polarizations in 
the plane are only proportional to their corresponding momenta, which altogether means for the three-vectors 



= -1 + 0{e) [eief) = I [4,]) = 0{e) 



{eip,)=Q (ek-) = 0, 



(2.97) 



with general i ^ j and exactly one and zero for i — j- The finite combinations between polarization vectors and 
momenta which we need are, in terms of the 3-dimensional opening angles /(e'jp) 

i.^lPb) = --BbCosZ(e|,pb) = -Ei,cos (| - 6*,,) = +£;6sin6'b ~ +£1,61, = z{l - z)Ea9 

(elpc) - -E,cosZ{4,Pc) - -E.cos (| + ^) = -E,sm9 ^ ^EJ = -(1 - z)Ea0 

{elpb) = -EbC0fiZ{4,pb) = -Ebcos (| - ^) = +EbSm9 ~ +Ef,e = zEad . (2.98) 



Using these kinematic relations we can tackle the actual splitting amplitude. For three gluons the splitting ampUtude 
will be proportional to the vertex, now switching back to the symmetric definition of all incoming momenta 



Vggg = WT'"' C^l^Z bafiiPa ~ Pb)^ + gfijiPb ~ Pc)a + g-raiPc - Pa)/?] 
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with Pa = -pb - Pc 
with {f-jPj) — 
with {eaPc) = -(eaPb) 
with {eipj) = . (2.99) 
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Squaring the splitting matrix element to compute the (n + 1) and n particle matrix elements squared for the 
unpolarized case gives us 
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(2.100) 



where the sum originally includes the three terms in the brackets of Eq.( 2.99 1. Each term in this sum is symmetric in 
two indices but gets multiplied with the anti-symmetric color factor. The final result will only be finite if we square 
each term individually as a product of two symmetric and two anti-symmetric terms. In other words, the sum over the 
external gluons becomes an incoherent polarization sum. 



Going through all possible combinations we know what can contribute inside the brackets of Eq.(2.99 1: (eHej,) as well 



as (e^e^) can be combined with (ecPb)', i^l^c) or (cj^Cc ) with (ellpb); and last but not least we can combine (eHec) 
and e^) with (efpc)- These six combinations contribute to the spUtting matrix element as 





£6 




±{e-e){e-p) 


pI 


pY 


II 

_L 


II 

_L 


II 
II 


i-l){-z)Ea0 


1 


z 

— z 


II 
II 


II 

_L 


II 

_L 


-{-l){-z){l-z)Ea9 


z(l 


-z) 


II 

_L 


II 
II 


II 

_L 


- z)Ea9 


1 


— z 
z 



These six cases correspond to four different polarizations of the three external gluons. For the coherent sum in 

25.' 1 \^{^-^)\^-pf 



Eq.(2.100i we find 
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with f^bcj^abd ^ j^^gcd 

with Na = 2 
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(z) - Ca 



1 - z 



+ z{l-z) 



(2.101) 



using Ca = and averaging over the color {N^ — 1) and polarization Na of the mother particle a. The factor 1 /2 in 
the first line takes into account that for two final state gluons the (n + l)-particle phase space is only half its usual 
size. The form of the splitting kernel is symmetric when we exchange the two gluons z and (1 — z). It diverges if 
either of the gluons become soft. The notation Pi^j ^ Pij is inspired by a matrix notation which we can use to 
multiply the splitting matrix from the right with the incoming parton vector to get the final parton vector. Following 
the logic described above, with this calculation we prove that the factorized form of the {n + l)-particle matrix 



element squared in Eq.(2.95 1 holds for gluons only. 



The same kind of splitting kernel we can compute for the spUtting of a gluon into two quarks and the splitting of a 
quark into a quark and a gluon 



giPa) q{pb) + q{Pc) 



and 



q{pa) q{pb) + g{pc) 



(2.102) 
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Both splittings include the quark-quark-gluon vertex, coupling the gluon current to the quark and antiquark spinors. 
The spinors of the massless quark u{pi,) and the massless antiquark v{pc) we can write in terms of two-component 
spinors 

^^P^^^{±X^ with X+ = (^0/2) (spin up) 

X- = (^"^^^^ (spin down). (2.103) 

For the massless antiquark we need to replace 9 — > —9 and take into account the different relative spin-momentum 
directions (crp), leading to the additional sign in the lower two spinor entries. The antiquark spinors then become 

v{p) ^ -i^/E (^^^^^^ with X+=(^_l/2) '^^+"(-1^) (spin up) 

ex- = ( i/o) (spin down). (2.104) 



1 J \-0/2 

Our calculations we again limit to the leading terms in the small scattering angle 9. In addition to the fermion spinors, 
for the coupling to a gluonic current we need the Dirac matrices which we can conveniently express in terms of the 
Pauli matrices defined in Section [TX2] 



In the notation introduced in Eq.(2.101 1 we first compute the splitting kernel Pq^g, sandwiching the qqg vertex 
between an outgoing quark u± (pb) and an outgoing antiquark v± (pa) for all possible spin combinations. We start 
with all four gluon polarizations, i.e. all four gamma matrices, between two spin-up quarks and their spinors written 
out in Eqs. p.l03| ) and ( |2.104| l 
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(2.106) 



Somewhat surprisingly the unphysical scalar and longitudinal gluon polarizations seem to contribute to this vertex. 
However, after adding the two unphysical degrees of freedom they cancel because of the form of our metric. For 
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transverse gluons we can compute this vertex factor also for the other diagonal spin combination 
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(2.107) 



Before collecting the prefactors for this gluon-quark splitting, we also need the same-spin case 
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, (2.108) 



which vanishes. The gluon current can only couple to two fermions via a spin flip. For massless fermions this means 
that the gluon splitting into two quarks involves two quark spin cases, each of them coupling to two transverse gluon 
polarizations. Keeping track of all the relevant factors our vertex function for the splitting g ^ qq becomes for each of 
the two quark spins 
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(2.109) 



omitting irrelevant factors i and (—1) which will drop out once we compute the absolute value squared. In complete 
analogy to the gluon splitting case we can factorize the (n + l)-particle matrix element into 
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Tr T^T" 1 
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IF^P + li^yT + l-p'+T + l^- 
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with Tr T«T^ = TrS"'' and iV„ = 2 



(2.110) 



with Tr = 1/2. In the first line we implicitly assume that the internal quark propagator can be written as something 
Uke uu/p1 and we only need to consider the denominator. This splitting kernel is again symmetric in z and {1 — z) 
because QCD does not distinguish between the outgoing quark and the outgoing antiquark. 
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The third spHtting we compute is gluon radiation off a quark, i.e. q{pa) q{pb) + 9{Pc), sandwiching the qqg vertex 
between an outgoing quark u± {pb) and an incoming quark u± (pa)- From the spHtting of a gluon into a 
quark-antiquark pair we already know that we can limit our analysis to the physical gluon polarizations and a spin flip 
in the quarks. Inserting the spinors from Eq.(2.103 1 and the two relevant gamma matrices gives us 
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(2.111) 



with the angles 61 and 0* relative to the final state gluon direction pc- Comparing to the situation shown in Figure |9] 
for the angle relative to the scattered gluon we find 91 = 9 while for the incoming quark 6** ~ ~9c — —z9. As 
expected, the spin-down case gives the same result, modulo a complex conjugation 
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(2.112) 



The vertex function for gluon radiation off a quark then becomes 
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again dropping irrelevant pref actors. The factorized matrix element for this channel reads 
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(2.114) 
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The averaging factor 1/Na ~ 2 now is the number of quark spins in the intermediate state. Just switching 
z (1 — z) we can read off the kernel for a quark splitting written in terms of the final state gluon 



1 + 



(2.115) 



This result finalizes our calculation of all QCD splitting kernels Pi^j{z) between quarks and gluons . As alluded to 
earlier, similar to ultraviolet divergences which get removed by counter terms these splitting kernels are universal, i.e. 
they do not depend on the h ard n-p ar ticle rn at rix elem ent which i s part of the original (n + l)-particle process. The 
four results we show in Eqs.(2.101 1, (2.110 1, (2.114 1, and (2.115 i. This means that by construction of the kernels P 
we have shown that the collinear factorization Eq.( 2.96| ) holds at this level in perturbation theory. 



Before using this splitting property to describe QCD effects at the LHC we need to look at the splitting of partons in 
the initial state, meaning \,p^ <^ \pl \ where pi, is the momentum entering the hard interaction. The difference to 
the final state splitting is that now we can consider the split parton momentum Pb ^ Pa — Pc as a f-channel diagram, 
so we already know pi — t < Q from our usual Mandelstam variables argument. This space-like splitting version of 



Eq.( 2.87 1 we can solve for p^ 
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using Eq. p.91| l 
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(2.116) 



The calculation of the splitting kernels and matrix elements is the same as for the time-like case, with the one 

exception that for splitting in the initial state the flow factor has to be evaluated at the reduced partonic energy 

Eb = zEa and that the energy fraction entering the parton density needs to be replaced by xi, — > ZX},. The factorized 



matrix element for initial state splitting then reads just like Eq.(2.96 1 



Cn+l 



^dz ^ P{z) 



(2.117) 



How to use this property to make statements about the quark and gluon content in the proton will be the focus of the 
next section. 



2.3.3 DGLAP equation 



What we now know about collinear parton splitting we can use to describe incoming partons. For example in — > Z 
production incoming partons inside the protons transform into each other via collinear splitting until they enter the Z 



production process as quarks. Taking Eq.( 2. 1 17 1 seriously the parton density we insert into Eq.( 2.28 1 then depends on 
two parameters, the final energy fraction and the virtuality /(a;„, — i„) where the second parameter t is new compared 



to the purely probabilistic picture in Eq.(2.28 i. It cannot be neglected unless we convince ourselves that it is 
unphysical. As we will see later it corresponds exactly to the artificial renormalization scale which appears when we 
re-sum the scaling logarithms which appear in counter terms. 



More quantitatively, we start with a quark inside the proton with an energy fraction xq, as it enters the hadronic phase 



space integral shown in Section 2.1.4 Since this quark is confined inside the proton it can only have small transverse 
momentum, which means its four- momentum squared is negative and its absolute value |to| is small. The variable t 
we call virtuality because for the incoming partons which if on-shell have p'^ = it gives the distance to the mass 
shell. Let us simplify our kinematic argument by assuming that there exists only one splitting, namely successive 
gluon radiation q ^ qg off an incoming quark, where the outgoing gluons are not relevant 
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ItJ 



Figure 10: Path of an incoming parton in the {x — t) plane. Because we define i as a negative number its axis is labelled 
\t\. 



{xo,to) {xi,ti) 




In that case each collinear gluon radiation will decrease the quark energy Xj+i < Xj and through recoil increase its 
virtuality \tj+i\ — —tj+i > —tj — \tj\. 

From the last section we know what the successive splitting means in terms of splitting probabilities. We can describe 
how the parton density f{x, —t) evolves in the {x — t) plane as depicted in Figure 10 The starting point {xq, to) is at 



least probabilistically given by the energy and kind of the hadron, for example the proton. For a given small virtuality 
|io| we start at some kind of fixed xq distribution. We then interpret each branching as a step strictly downward in 
Xj — > Xjj^i where the t value we assign to this step is the ever increasing virtuality |tj+i | after the branching. Each 
splitting means a synchronous shift in x and t, so the actual path in the [x — t) plane really consists of discrete points. 
The probability of such a splitting to occur is given by Pqj^q{z) = P{z) as it appears in Eq.(2.117 1 



^ P{z) ^ dz . 



(2.118) 



In this picture we consider this probability a smooth function in t and z. At the end of the path we will probe this 
evolved parton density, where Xn and i„ enter the hard scattering process and its energy-momentum conservation. 



When we convert a partonic into a hadronic cross section numerically we need to specify the probabihty of the parton 
density f{x, —t) residing in an infinitesimal square [xj , Xj + Sx] and, if this second parameter has anything to do with 
physics, [\tj\, \tj \ + 6t]. Using our {x, t) plane we compute the flow into this square and out of this square which 
together defines the net shift in / in the sense of a differential equation, similar to the derivation of Gauss' theorem for 
vector fields inside a surface 

Sfin-Sf,,,^Sfix,-t) . (2.119) 
The incoming and outgoing flow we compute from the history of the (x, t) evolution. At this stage our picture 



becomes a little subtle; the way we define the path between two splittings in Figure 10 it can enter and leave the 
square either vertically or horizontally, but we have to decide which we choose. If we define a splitting as a vertical 
drop in x at the target value tjj^i an incoming path hitting the square at some value t can come from any x value above 



the square. Using this convention and following the fat solid lines in Figure 10 the vertical flow into (and out of) the 
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square (x, t) square is proportional to 5t 

5U~t) = 5t • 
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z 2tt ^ \z' J 

assuming f{x', —t) = for a;' > 1 



(2.120) 



where 5t is the size of the interval covered by the virtuality value t. We use the definition of a convolution 

X \ dx2 

/O Jo ^1 " \XlJ Jo ^2 

The outgoing flow we define in complete analogy, again leaving the infinitesimal square vertically. FoUowing the fat 
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g{x2). (2.121) 



solid line in Figure 10 the outgoing flow is also proportional to St 



St I dy- 
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2TTt 
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(2.122) 



The y integration, unlike the z integration for the incoming flow is not a convolution. This integration appears because 
we do not know the normalization of P{z) distribution which we interpret as a probability. The reason why it is not a 
convolution is that for the outgoing flow we know the starting condition and integrate over the final configurations; 



this aspect will become important later. Combining Eq.( 2. 120 1 and Eq.( 2. 122 1 we can compute the change in the 
parton density of the quarks as 
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(2.123) 



again assuming f{x) = for x > 1, strictly speaking requiring as to only depend on t but not on z, and using the 
specifically defined plus subtraction scheme 



F{z)+ = F{z)-S{l-z) I dyF{y) 
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(2.124) 



For the second term we choose F{z) = 1/(1 — z), multiply it with an arbitrary test function f{z) and integrate over z. 
In contrast to the original z integral the plus-subtracted integral is by definition finite in the limit z — 1 where some of 
the splitting kernels diverge. For example the quark splitting kernel including the plus prescription becomes of this 
form Cf((1 + z^)/(1 — z))+. At this stage the plus prescription is simply a convenient way of writing a complicated 
combination of splitting kernels, but we will see that it also has a physics meaning. 

We can check that the plus prescription indeed acts as a regularization technique for the parton densities. Obviously, 
the integral over /(2)/(l — z) is divergent at the boundary z — 1, which we know we can cure using 
dimensional regularization. The special case /(z) = 1 illustrates how dimensional regularization of infrared 
divergences in the phase space integration Eq. p.85 1 works 
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(2.125) 
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for 4 + 2e dimensions. This change in sign avoids the analytic continuation of the usual value n 4 — 2e to e < 0. 
The dimensionally regularized integral we can write as 
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(2.126) 



The dimensionally regularized integral minus the pole, i.e. the finite part of the dimensionally regularized integral, is 
the same as the plus-subtracted integral modulo terms of the order e. The third line in Eq.( |2.126l ) shows that the 
difference between a dimensionally regularized splitting kernel and a plus-subtracted splitting kernel manifests itself 
as terms proportional to (5(1 — z). Physically, they represent contributions to a soft-radiation phase space integral. 



Before we move on introducing a gluon density we can slightly reformulate the splitting kernel Pq^q in Eq.( 2. 1 14 1. If 
the plus prescription regularizes the pole at z — > 1, what happens when we include the numerator of the regularized 
function, e.g. the quark splitting kernel, as compared to leaving it out? The finite difference between these results is 
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We can therefore write the quark's splitting kernel in two equivalent ways 
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(2.128) 



The infinitesimal version of Eq.( 2. 123 i is the Dokshitzer-Gribov-Lipatov-Altarelli-Parisi or DGLAP 
integro-differential equation which describes the scale dependence of the quark parton density. As we already know 
quarks do not only appear inq ^ q splitting, but also in gluon splitting. Therefore, we generalize Eq.( |2.123| to 
include the full set of QCD partons, i.e. quarks and gluons. This generalization involves a sum over all allowed 
splittings and the plus-subtracted splitting kernels. For the quark density on the left hand side it is 
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Going back to Eq.( 2.123] l we add all relevant parton indices and splittings and arrive at 
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(2.130) 



2.3 Infrared divergences 



85 



Of the three terms on the right hand side the first and the third together define the plus-subtracted spUtting kernel 
Pq^q{z), just following the argument above. The second term is a proper convolution and the only term proportional 
to the gluon parton density. Quarks can be produced in gluon splitting but cannot vanish into it. Therefore, we have to 
identify the second term with Pq^g in Eq.(2.129 1 without adding a plus-regulator 



Pq^g{z) = Pq^g{z) - Tr [Z^ + {1-Z) 



(2.131) 



In principle, the spUtting kernel Pg^q also generates a quark, in addition to the final state gluon. However, comparing 
this to the terms proportional to Pq^q they both arise from the same splitting, namely a quark density leaving the 
infinitesimal square in the {x — t) plane via the splitting q — > qg. Including the additional Pg^q{y) would be double 
counting and should not appear, as the notation g ^ q already suggests. 

The second QCD parton density we have to study is the gluon density. The incoming contribution to the infinitesimal 
square is given by the sum of four splitting scenarios each leading to a gluon with virtuality — tj+i 
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(2.132) 



using Pg^g{l — z) = Pg^g{z). To Icavc the volume element in the [x, t) space a gluon can either split into two 
gluons or radiate one of n/ light-quark flavors. Combining the incoming and outgoing flows we find 
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(2.133) 



Again, for the gluon density there exists a contribution to 5fia proportional to fq or fq which is not matched by the 
outgoing flow. On the other hand, from the quark case we already know how to deal with it, namely by defining 
without a second plus subtraction 
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(2.134) 



This ensures that the off-diagonal contribution to the gluon density is taken into account following Eq.( |2.129|l. 
Three regularized splitting kernels entering the DGLAP equation we give in Eq.(2.128 1, Eq.(2.131 1, and Eq. (2.1341 



The generic structure of the DGLAP equation implies that the two off-diagonal splitting kernels do not include any 



plus prescription Pi^ 



^P, 



We could have expected this because these off -diagonal kernels are finite in the soft 



limit z 0, 1. Applying a plus prescription would only have modified the splitting kernels at the isolated 
(zero-measure) point y = 1 which for a finite value of the integrand does not affect the integral on the right hand side 
of the DGLAP equation. 



The final splitting kernel Pg^g from the diagonal relation on the right hand side of Eq.(2.133 i requires some more 
work. The y integral not involving the gluon parton densities we can explicitly carry out for the gluon splitting into a 
quark pair 
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The second y integral for gluon radiation has to consist of a finite term and a term we can use to define the plus 
prescription for Pg^g 
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The contribution proportional to 5(1 — z) is necessary to give a finite result and is absorbed into the plus prescription 
of the convoluted Pg^g{z), including the factor two in front. This defines the last remaining regularized splitting 
kernel 



Pg^g{z) = 2Ca 



(2.137) 



and concludes our computation of all four regularized splitting functions which appear in the DGLAP equation 



Eq.(2.129i. 



Before discussing and solving the DGLAP equation, let us briefly recapitulate: for the full quark and gluon particle 
content of QCD we have derived the DGLAP equation which describes a (so-called factorization) scale dependence of 
the quark and gluon parton densities. The universality of the splitting kernels is obvious from the way we derive them 
— no information on the ?i-particle process ever enters the derivation. 

The DGLAP equation is formulated in terms of four splitting kernels of gluons and quarks which are linked to the 
splitting probabilities, but which for the DGLAP equation have to be regularized. With the help of a plus-subtraction 
the kernels Pi^j{z) are all finite, including in the soft limit z 1. However, splitting kernels are only regularized 
when needed, so the finite off -diagonal quark-gluon and gluon-quark splittings are unchanged. This means the plus 
prescription really acts as an infrared renormalization, moving universal infrared divergences into the definition of the 
parton densities. The original collinear divergence has vanished as well. 

The only approximation we make in the computation of the splitting kernels is that in the y integrals we implicitly 
assume that the running coupling as does not depend on the momentum fraction. In its standard form and in terms of 
the factorization scale = —t the DGLAP equation reads 
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2.3.4 Parton densities 



Solving the integro-differential DGLAP equation Eq. p.l38| ) for the parton densities is clearly beyond the scope of 
this writeup. Nevertheless, we will sketch how we would approach this. This will give us some information on the 
structure of its solutions which we need to understand the physics of the DGLAP equation. 

One simplification we can make in this illustration is to postulate eigenvalues in parton space and solve the equation 

for them. This gets rid of the sum over partons on the right hand side. One such parton density is the 

non-singlet parton density, defined as the difference of two parton densities f^^ = (/„ — /a). Since gluons cannot 
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distinguish between quarks and antiquaries, the gluon contribution to their evolution cancels, at least in the massless 
limit. This will be true at arbitrary loop order, since flavor SU{Z) commutes with the QCD gauge group. The 
corresponding DGLAP equation with leading order splitting kernels now reads 



d\og 



^ dz as 



(2.139) 



To solve it we need some kind of transformation which disentangles the convolution, namely a Mellin transform . 
Starting from a function f{x) of a real variable x we define the Mellin transform into moment space m 



M[f]{m)= I dxx'^-'fix) 
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(2.140) 



The integration contour for the inverse transformation lies to the right of all singularities of the analytic continuation 
of A4 [/] (to), which fixes the offset c. The Mellin transform of a convolution is the product of the two Mellin 
transforms, which gives us the transformed DGLAP equation 



dM[ff]im,fip 



dz 

Z ^ ^ \Z 



(to) - M[P,^,]{7n) M[ff][m,iiF) 



dlog y?p 



(2.141) 



and its solution 
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defining 7(to) = Ai[P]{m). Instead of assuming a fixed Ug in the transformed DGLAP equation Eq.(2.141 1 

iti( 

ji. This allows us to replace log fi^ in the DGLAP equation by a^, 



include as{^i\) in the running of the DGLAP equation, identifying the renormalization scale /i/^ of the strong 



coupling with the factorization scale ^ip - 
including the leading order Jacobian 

d d log Us 



1 



da„ 



-ctsbo 



dlogfj,'^ dlogfi'^ dlogas as dlogfi'^ dlogas dloga 
This additional factor of on the left hand side will cancel the factor as on the right hand side of the DGLAP 
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equation Eq.(2.141 
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Among other things in this derivation we neglect that some splitting functions have singularities and therefore the 
Mellin transform is not obviously well de fined. O ur convolution is not really a convolution either, because we cut it 
off at Qg etc; but the final structure in Eq.(2. 144 1 really holds. 
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Because we will need it in the next section we emphasize that the same kind of solution appears in pure Yang-Mills 
theory, i.e. in QCD without quarks. Looking at the different color factors in QCD this limit can also be derived as the 
leading terms in Nc- In that case there exists also only one splitting kernel defining an anomalous dimension 7, and 



we find in complete analogy to Eq.(2.144i 
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To remind ourselves that in this derivation we unify the renormaUzation and factorization scales we denote them just 
as /i. This solution to the DGLAP equation is not completely determined: as a solution to a differential equation it 
also includes an integration constant which we express in terms of /ig. The DGLAP equation therefore does not 
determine parton densities, it only describes their evolution from one scale to another, just like a renormalization 
group equation in the ultraviolet. 



The structure of Eq.( 2. 145 1 already shows something we will in more detail discuss in the following Section 2.3.5 the 
splitting probability we find in the exponent. To make sense of such a structure we remind ourselves that such ratios of 
as values to some power can appear as a result of a re-summed series. Such a series would need to include powers of 
(A^ [P])"' summed over n which corresponds to a sum over splittings with a varying number of partons in the final 
state. Parton densities cannot be formulated in terms of a fixed final state because they include effects from any 
number of collinearly radiated partons summed over the number of such partons. For the processes we can evaluate 
using parton densities fulfilling the DGLAP equation this means that they always have the form 



X 



where X includes any number of coUinear jets 



(2.146) 



Why is 7 is referred to as the anomalous dimension of the parton density? This is best illustrated using a running 
coupling with a finite mass dimension, like the gravitational coupling Gpianck ~ When we attach a 

renormalization constant Z to this coupling we first define a dimensionless running bare coupling g. In n dimensions 
this gives us 

gbare ^ g^^^^^^^ ^bare ^ q^^^^^^ ^ ZgiM^) . (2.147) 

For the dimensionless gravitational coupling we can compute the running 

dgjM^) _ d ( 1 2 \ 

1 dZ 

= g{M){n-2 + rj) with r^ = --^^—^ (2.148) 

Hence, there are two sources of running for the renormalized coupling g{M'^): first, there is the mass dimension of the 
bare coupling n — 2, and secondly there is r], a quantum effect from the coupling renormalization. For obvious reasons 
we call r/ the anomalous dimension of Gpianck- 



This is similar to the running of the parton densities in Mellin space, as shown in Eq.(2.141 1 with 7(to) defined in 



Eq.(2.142 1, so we refer to 7 as an anomalous dimension as well. The entire running of the transformed parton density 
arises from collinear splitting, parameterized by a finite 7. There is only a slight stumbling step in this analogy: 
usually, an anomalous dimension arises through renormalization involving a ultraviolet divergence and the 
renormaUzation scale. In our case we are discussing an infrared divergence and the factorization scale dependence. 

2.3.5 Re-summing coUinear logarithms 

Remembering how we arrive at the DGLAP equation we notice an analogy to the case of ultraviolet divergences and 
the running coupling. We start from universal infrared divergences. Those we describe in terms of splitting functions 
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which we regularize using the plus prescription. The DGLAP equation plays the role of a renormalization group 
equation for example for the running coupling. It links parton densities evaluated at different scales ^p- 
In analogy to the scaling logarithms considered in Section : 



2.2.3 



we now test if we can point to a type of logarithm the 



DGLAP equation re-sums by reorganizing our perturbative series of parton splitting. To identify these re-summed 
logarithms we build a physical model based on collinear splitting but without using the DGLAP equation. We then 
solve it to see the resulting structure of the solutions and compare it to the structure of the DGLAP solutions in 
Eq.( |rT45l l. 



We start from the basic equation defining the physical picture of parton splitting in Eq.(2.96 1. Only taking into 
account gluons in pure Yang-Mills theory it precisely corresponds to the starting point of our discussion leading to the 
DGLAP equation, schematically written as 



(z). 
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What we now need is an exact physical definition of the virtuality variable t describing initial-state splitting. If we 
remember that t — < we can follow Eq.(2.116 1 and introduce a positive transverse momentum variable such 
that 
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From the definition of pT in the Sudakov decomposition Eq.(2.87 i we see that is really the transverse 
three-momentum of of the parton pair after splitting. 



If beyond the single parton radiation discussed in Section [2.3.1| we consider a ladder of successive splittings of one 
gluon into two and we for a moment forget about the actual parton densities we can write the hadronic cross section in 
the collinear limit including the appropriate convolution as 
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The dz in Eq.(2.149 1 we replace by the proper convolution P (T„, evaluated at the momentum fraction x. Because 
the splitting kernel is infrared divergent we cut off the convolution integral at xq. Similarly, the transverse momentum 
integral is bounded by an infrared cutoff /ip and the physical external scale [ip. For the split of the two integrals in 



Eq.(2.151 1 it is crucial that /io is the only scale the final matrix element a\ depends on. 

Identifying [ip with the upper boundary of the transverse momentum integration for collinear splitting is the first 



assumption we make for our model. The recursion in Eq.(2.151 1 we can then apply iteratively 
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The two sets of integrals in this equation we will solve one by one, starting with the integrals. 



The crucial physics assumption in our multiple-splitting model concerns the integration boundaries, in addition to the 
global upper limit jip: as the second assumption the transverse momenta of the splittings should be strongly ordered; 
the first splitting integrated over p^ lis bounded from above by the next external scale p^ 2' which is then bounded by 
3, etc. For the n-fold integration this means 



Ho < Pt,1 < Pt,2 < 



(2.153) 
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The transverse momentum integrals in Eq.(2.152 1 then become 
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Note that just as in Eq.( 2. 144 1 as our third assumption we identify the scale of the strong coupling with the 
transverse momentum scale of the splitting /i^ = p^. These integrals we can solve by switching variables, for 
example in the last integral 
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In the second line we shift the integration variable by a constant. To simplify the notation we throughout keep the 
integration boundaries in terms of \pT.j \ ■ The chain of integrals over p^ j we can solve, provided the integrand does 
not have any additional p^ ■ dependence. This gives us 
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This is the final result for the chain of transverse momentum integrals in Eq.(2.152 1. Again, we see that the strong 
coupling is evaluated at the factorization scale p,p, which means we identify = jip following our third assumption. 



To compute the convolution integrals over the momentum fractions in the same equation 
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we again Mellin transform the equation into moment space 
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We can now sum the production cross sections for n collinear jets and obtain 
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This way we can write the Mellin transform of the (n + 1) particle production rate as the product of the ?i -particle rate 
times a ratio of the strong coupling at two scales 
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This is the same structure as the DGLAP equation's solution in Eq.(2.145 1. It means that we should be able to 
understand the physics of the DGLAP equation using our model calculation of a gluon ladder emission, including the 
generically variable number of collinear jets in the form of pp l^^fJ'~ + X, as shown in Eq.(2.146 1. 



We should remind ourselves of the three assumptions we need to make to arrive at this form. There are two 
assumptions which concern the transverse momenta of the successive radiation: first, the global upper limit on all 
transverse momenta should be the factorization scale fjLp, with a strong ordering in the transverse momenta. This gives 
us a physical picture of the successive splittings as well as a physical interpretation of the factorization scale. Second, 
the strong coupling should be evaluated at the transverse momentum or factorization scale, so all scales are unified, in 
accordance with the derivation of the DGLAP equation. 

Bending the rules of pure Yang-Mills QCD we can come back to the hard process ci as the Drell-Yan process 

qq Z. Each step in n means an additional parton in the final state, so Cn+i is Z production with n collinear partons 



On the left hand side of Eq.( 2. 160 1 we have the sum over any number of additional collinear partons; on the right hand 
side we see fixed order Drell-Yan production without any additional partons, but with an exponentiated correction 
factor. Comparing this to the running parton densities we can draw the analogy that any process computed with a scale 
dependent parton density where the scale dependence is governed by the DGLAP equation includes any number of 
collinear partons. 

The logar ithms which are re-summed by scale dependent parton densities we can also identify. Going back to 
Eq.(2.84i reminds us that we start from the divergent collinear logarithms logp™''/^™" arising from the collinear 
phase space integration. In our model for successive splitting we replace the upper boundary by ^p. The collinear 
logarithm of successive initial state parton splitting diverges for /iq ^ 0, but it gets absorbed into the parton densities 
and determines the structure of the DGLAP equation and its solutions. The upper boundary jip tells us to what extent 
we assume incoming quarks and gluons to be a coupled system of splitting partons and what the maximum 
momentum scale of these splittings is. Transverse momenta pT > generated by hard parton splitting are not 
covered by the DGLAP equation and hence not a feature of the incoming partons anymore. They belong to the hard 



process and have to be consistently simulated, as we will see in Sections 2.5.3 and 2.6 While this scale can be chosen 
freely we have to make sure that it does not become too large, because at some point the collinear approximation 
C ~ constant in Eq.(2.84 1 ceases to hold and with it our entire argument. Only if we do everything correctly, the 
DGLAP equation re-sums logarithms of the maximal transverse momentum size of the incoming gluon. They are 
universal and arise from simple kinematics. 
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renormalization scale /ij? 


factorization scale /i p 


source 


ultraviolet divergence 


coUinear (infrared) divergence 


poles cancelled 


counter terms 


parton densities 




(renormalization) 


(mass factorization) 


summation 


re-sum self energy bubbles 


re-sum parton splittings 


parameter 


running coupling Qs(/if;) 


running parton density fj{x,fiF) 


evolution 


RGB for 


DGLAP equation 


large scales 


decrease of am 


increase of aiot for gluons/sea quarks 


theory background 


renormalizability 


factorization 




proven for gauge theories 


proven all orders for DIS 






proven order-by-order DY... 



Table 2: Comparison of renormalization and factorization scales appearing in LHC cross sections. 



The ordering of the splittings we have to assume is not relevant unless we simulate this splitting, as we will see in the 
next section. For the details of this we have to remember that our argument follows from the leading collinear 
approximation introduced in Section 2.3.1 Therefore, the strong pT ordering can in practice mean angular ordering or 
rapidity ordering as well, just applying a linear transformation. 



2.4 Scales in LHC processes 



Looking back at Sections 2.2 and 2.3 we introduced the factorization and renormalization scales step by step 
completely in parallel: first, computing perturbative higher order contributions to scattering amplitudes we encounter 
ultraviolet and infrared divergences. Both of them we regularize using dimensional regularization with 
n = 4 — 2e < 4 for ultraviolet and n > 4 for infrared divergences, linked by analytic continuation. For both kinds of 
divergences we notice that they are universal, i.e. not process or observable dependent. This allows us to absorb 
ultraviolet and infrared divergences into a re-definition of the strong coupling and the parton density. This nominally 
infinite shift of parameters we refer to as renormalization for example of the strong coupling or as mass factorization 
absorbing infrared divergences into the parton distributions. 

After renormalization as well as after mass factorization we are left with a scale artifact . Scales arise as part of a the 
pole subtraction: together with the pole 1/e we have a choice of finite contributions which we subtract with this pole. 
Logarithms of the renormalization and factorization scales will always be part of these finite terms. Moreover, in both 
cases the re-definition of parameters is not based on fixed order perturbation theory. Instead, it involves summing 
logarithms which otherwise can become large and spoil the convergence of our perturbative series in a^. The only 
special feature of infrared divergences as compared to ultraviolet divergences is that to identify the resummed 
logarithms we have to unify both scales to one. 

The hadronic production cross section for the Drell-Yan process or other LHC production channels, now including 
both scales, reads 

<^toti^J'F,^J■B) = / dxi / dx2 fi{xi,^XF) fj{x2,lJ.F) Crij{xiX2S,as{^\),^.F,tJ'R) ■ (2.161) 

The Drell-Yan process has the particular feature that at leading order cTqq only involves weak couplings, it does not 
include as with its implicit renormalization scale dependence at leading order. Strictly speaking, in Eq.( |2.16l"] l the 
parton densities also depend on the renormalization scale because in their extraction we identify both scales. Carefully 
following their extraction we can separate the two scales if we need to. Lepton pair production and Higgs production 
in weak boson fusion are the two prominent electroweak production processes at the LHC. 



The evolution of all running parameters from one renormalization/factorization scale to another is described either by 
renormalization group equation in terms of a beta function in the case of renormalization and by the DGLAP equation 
in the case of mass factorization. Our renormalization group equation for as is a single equation, but in general they 



2.4 Scales in LHC processes 



93 



are sets of coupled differential equations for all relevant parameters, which again makes them more similar to the 
DGLAP equation. 

There is one formal difference between these two otherwise very similar approaches. The fact that we can absorb 
ultraviolet divergences into process-independent, i.e. universal counter terms is called renormalizability and has been 
proven to all orders for the kind of gauge theories we are dealing with. The universality of infrared splitting kernels 
has not (yet) in general been proven, but on the other hand we have never seen an example where is fails for 
sufficiently inclusive observables like production rates. For a while we thought there might be a problem with 
factorization in supersymmetric theories using the MS scheme, but this issue has been resolved. A summary of the 
properties of the two relevant scales for LHC physics we show in Table [2] 



The way we introduce factorization and renormalization scales clearly labels them as an artifact of perturbation 
theories with divergences. What actually happens if we include all orders in perturbation theory? For example, the 
re-summation of the self energy bubbles simply deals with one class of diagrams which have to be included, either 
order-by-order or rearranged into a re-summation. Once we include all orders in perturbation theory it does not matter 
according to which combination of couplings and logarithms we order it. An LHC production rate will then not 
depend on arbitrarily chosen renormalization or factorization scales /i. 

Practically, in Eq.(2.161 1 we evaluate the renormalized parameters and the parton densities at some scale. This scale 
dependence will only cancel once we include all implicit and explicit appearances of the scales at all orders. Whatever 
scale we choose for the strong coupling or parton densities will eventually be compensated by explicit scale 
logarithms. In the ideal case, these logarithms are small and do not spoil perturbation theory. In a process with one 
distinct external scale, like the Z mass, we know that all scale logarithms should have the form \og{iJ,/mz). This 
logarithm vanishes if we evaluate everything at the 'correct' external energy scale, namely m^. In that sense we can 
think of the running coupling as a proper running observable which depends on the external energy of the process. 
This dependence on the external energy is not a perturbative artifact, because a cross section even to all orders does 
depend on the energy. The problem in particular for LHC analyses is that after analysis cuts every process will have 
more than one external energy scale. 



We can turn around the argument of vanishing scale dependence to all orders in perturbation theory. This gives us an 
estimate of the minimum theoretical error on a rate prediction set by the scale dependence. The appropriate interval of 
what we consider reasonable scale choices depends on the process and the taste of the people doing this analysis. This 
error estimate is not at all conservative; for example the renormalization scale dependence of the Drell-Yan production 
rate or Higgs production in weak boson fusion is zero because as only enters are next-to-leading order At the same 
time we know that the next-to-leading order correction to the Drell-Yan cross section is of the order of 30%, which far 
exceeds the factorization scale dependence. Moreover, the different scaling behavior of a hadronic cross section shown 
in Table [2] implies that for example gluon-induced processes at typical x values around 10^^ show a cancellation of 
the factorization and renormalization scale variation. Estimating theoretical uncertainties from scale dependence 
therefore requires a good understanding of the individual process and the way it is affected by the two scales. 



Guessing the right scale choice for a process is hard, often impossible. For example in Drell-Yan production at 
leading-order there exists only one scale, mz- If we set fi = mz all scale logarithms vanish. In reality, LHC 
observables include several different scales. Some of them appear in the hard process, for example in the production 
of two or three particles with different masses. Others enter through the QCD environment where at the LHC we only 
consider final state jets above a certain minimal transverse momentum. Even others appear though background 
rejection cuts in a specific analysis, for example when we only consider the Drell-Yan background for m^t^t > 1 TeV 
to Kaluza-Klein graviton production. Using likelihood methods does not improve the situation because the phase 
space regions dominated by the signal will introduce specific energy scales which affect the perturbative prediction of 
the backgrounds. This is one of the reasons why an automatic comparison of LHC events with signal or background 
predictions is bound to fail once it requires an estimate of the theoretical uncertainty on the background simulation. 
All that means that in practice there is no way to define a 'correct' scale. On the other hand, there are definitely 
poor scale choices. For example, using 1000 x mz as a typical scale in the Drell-Yan process will if nothing else lead 
to logarithms of the size log 1000 whenever a scale logarithm appears. These logarithms eventually have to be 
cancelled to all orders in perturbation theory, inducing unreasonably large higher order corrections. 
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When describing jet radiation, we usually introduce a phase-space dependent renormalization scale, evaluating the 
strong coupling at the transverse momentum of the radiated jet as{p^ j). This choice gives the best kinematic 
distributions for the additional partons because in Section 2.3.5| we have shown that it re-sums large coUinear 
logarithms. 

The transverse momentum of a final state particle is one of scale choices allowed by factorization; in addition to poor 
scale choices there also exist wrong scale choices, i.e. scale choices violating physical properties we need. 
Factorization or the Kinoshita-Lee-Nauenberg theorem which ensures that soft divergences cancel between real and 
virtual emission diagrams are such properties we should not violate — in QED the same property is called the 
Bloch-Nordsieck cancellation. Imagine picking a factorization scale defined by the partonic initial state, for example 
the partonic center-of-mass energy s = X1X2S. We know that this definition is not unique: for any final state it 
corresponds to the well defined sum of all momenta squared. However, virtual and real gluon emission generate 
different multiplicities in the final state, which means that the two sources of soft divergences only cancel until we 
multiply each of them with numerically different parton densities. Only scales which are uniquely defined in the final 
state can serve as factorization scales. For the Drell-Yan process such a scale could be niz, or the mass of heavy 
new-physics states in their production process. So while there is no such thing as a correct scale choice, there are more 
or less smart choices, and there are definitely very poor choices, which usually lead to an unstable perturbative 
behavior. 
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In LHC phenomenology we are usually less interested in fixed-order perturbation theory than in logarithmically 
enhanced QCD effects. Therefore, we will not deepen our discussion of hadronic rates as shown in Eq.(2.161 1 based 



on fixed-order partonic cross sections convoluted with parton densities obeying the DGLAP equation. In Section 2.3.5 
we have already seen that there exist more functions with the same structure as solutions to the DGLAP equation. In 
this section we will look for other structures which obey the DGLAP equation, leading us to Sudakov form factors and 



the parton shower. In Section 2.5.2 we will discuss some of the physical properties of the parton shower Per se it it 
not clear how jet radiation described by the parton shower and jet radiation described by fixed-order QCD processes 



are linked. In Section 2.5.3 we will discuss ways to combine the two approaches in realistic LHC simulations, 
bringing us very close to contemporary research topics. 



2.5.1 Sudakov form factor 

The splitting kernels Pi^j{z) we introduce as something like splitting probabilities, but we never apply a probabilistic 
approach to parton splitting. The basis of such an interpretation are Sudakov form factors describing the splitting of a 



parton i into any of the partons j based on the factorized form Eq.(2.96 1 



A,(i) = A,(t,to) =exp 




(2.162) 



Before we check that such a function can obey the DGLAP equation we confirm that such exponentials appear in 



probabilistic arguments, similar to our discussion of the central jet veto in Section 1.5.2 Using Poisson statistics for 
something expected to occur p times, the probability of observing it n times is given by 



V{n-p) 



(2.163) 



If the exponent in the Sudakov form factor in Eq.(2.162 1 describes the integrated splitting probability of a parton i this 

means that the Sudakov itself describes a non-splitting probability of the parton i into any final state j. 

Using probabilistic Sudakov factors we can use a Monte Carlo {i.e. a Markov process without a memory of individual 



past steps) to compute a chain of parton splittings as depicted in Figure 10 This will describe a quark or a gluon 
propagating forward in time. Starting from a point (xi, ti) in momentum-virtuality space we step by step move to the 
next splitting point {xj,tj). Following the original discussion t2 is the target virtuality at X2, and for time-like final 
state branching the virtuality is positive tj > in all points j. The Sudakov factor is a function of t, so it gives us the 



2.5 Parton shower 



95 



probability of not seeing any branching between ti and t2 as A(ii)/A(i2) < 1- The appropriate cutoff scale to drops 
out of this ratio. Using a flat random number rt the t2 distribution is implicitly given by the solution to 

^^=rte [0,1] with h>t2>to>0. (2.164) 

1^(12) 

Beyond the absolute cutoff scale to we assume that no resolvable branching occurs. 

In a second step we need to compute the matching energy fraction X2 or the ratio X2/X1 describing the momentum 



fraction which is kept in the splitting at X2- The y integral in the Sudakov factor in Eq.(2.162 1 gives us this probability 
distribution which we can again implicitly solve for X2 using a flat random number rx 



--^ = rx e [0, 1 with xi > X2 > ■ (2.165) 

For splitting kernels with soft divergences at y = or y = 1 we should include a numerical cutoff in the integration 
because the probabilistic Sudakov factor and the parton shower do not involve the regularized splitting kernels. 
Of the four momentum entries of the radiated parton the two equations Eqs.( |2.16"4| and ( |2.165| l give us two. The 
on-shell mass constraint fixes a third, so all we are left is the azimuthal angle distribution. We know from symmetry 
arguments that QCD splitting is insensitive to this angle, so we can generate it randomly between zero and 27r. For 
final state radiation this describes probabilistic branching in a Monte Carlo program, just based on Sudakov form 
factors. 



The same statement for initial state radiation including parton densities we will put on a more solid or mathematical 
footing. The derivative of the Sudakov form factor Eq.(2.162 1 

1 dA,{t) 



Ai{t) dt 



Sir 



(2.166) 



is precisely the second term in df{x, t)/dt for diagonal splitting, as shown in Eq.(2.123 1 



dfi{x,t) 1 



dt 



p, 



dz as 
~z 27r 

Z 271 



dy ^ Pj^iiy) fi {x,t) 



A,{t) dt 



(2.167) 



This relation suggests to consider the derivative of the /,;/Aj instead of the Sudakov factor alone to obtain something 
Hke the DGLAP equation 



d fi{x,t) 



df,ix,t) f,{x,t) d\{t) 



dt Ai{t) A,{t) dt Aj(t)2 

""^ dz as 
~z 27r 

dz as 

Ai{t) t z 27r 




dt 



A,{t) dt 



AM' 



dt 



(2.168) 



In the last step we cancel what corresponds to the plus prescription for diagonal splitting, i.e. we remove the 
regularization of the splitting kernel at z 1. Therefore, we need to modify the upper integration boundary by a 
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small parameter e which can in principle depend on t. The resulting equation is the diagonal DGLAP equation with 
unsubtracted splitting kernels, solved by the ratio of parton densities and Sudakov factors 



^ d fi{x,t) _ d Mx,t) 
^dt A,{t) ~ dlogt A,{t) 



E 



dz a, 
^ p. 

Z 2-K 



(^) 



X 

z 



(2.169) 



We can study the structure of these solutions of the unsubtracted DGLAP equation by integrating //A between 
appropriate points in t 



A,(t) 



A, (to) 
Mx,t) 



dt 



z 27r '^'^> \{t') 



A, (to) 



dt' Aj(t) 



dz as 
~z 27r 



A^,(z)/, -,t' 



A.(t)/,(x,to) + 

Ai{t,to)Mx,to) 



dt^ A,{t) ^ 
t' A,(t')^ 



dz as A 

1 p 

z 2t: 



dt' 



dz a 
~ 2 



(.)/.(^,t') 



(2.170) 



where we choose to such that A(to) = 1 and introduce the notation A(ti, t2) = A(ti, to)/A(t2, to) for the ratio of 
two Sudakov factors in the last line. This formula for the dependence of the parton density /^(a;, t) on x and t has a 
suggestive interpretation: corresponding to Eq.(2.163 i the first term can be interpreted as 'nothing happening to / 
between to and t' because it is weighted by the Sudakov no-branching probability A^. The second term includes the 
ratio of Sudakov factors which just like in Eq.(2.164i means no branching between t' and t. Integrating this factor 
times the splitting probability over t'e[to; t] implies at least one branching between to and t. 



The issue of this interpretation of the Sudakov form factor in conjunction with the parton densities is its numerical 
usability in a probabilistic approach: starting from a parton density somewhere in [x — t) space we need to evolve it to 
a fixed point (a;„, t„) given by the hard subprocess, e.g. qq Z with mz giving the scale and energy fraction of the 
two quarks. Numerically it would be much easier to simulate backwards evolution where we start from the known 
kinematics of the hard process and the corresponding point in the {x — t) plane and evolve towards the partons in the 
proton, ideally to a point where the probabilistic picture of collinear, stable, non-radiating quarks and gluons in the 
proton holds. This means we need to define a probability that a parton evolved backwards from a space-like t2 < to 
ti < with |t2| > |ti| does not radiate or split. 

For this final step we define a probability measure for the backwards evolution of partons n(ti , t2 ; a;) . Just like the 



two terms in Eq.(2. 170 1 it links the splitting probability to a probability of an undisturbed evolution. For example, we 
can write the probability that a parton is generated by a splitting in the interval [t, t + 6t], evaluated at (t2, x), as 
dF{t; t2). The measure corresponding to a Sudakov survival probability is then 



n(ti,t2;x) = 1 - / dF{t-t2) ■ (2.171) 



Comparing the definition of dF to the relevant terms in Eq.(2.170l and replacing t — > t2 and t' twe know what 
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happens for the combination 

Mx,t2)dF{t;t2) 



dt A,{t2) 
t A,{t) 

dt A,(t2) ] 

dtA,{t2) 



dz as - 
z 2tt 



E 



dz as - 
z 2n 



d fiix,t) 
dt A,{t) 



using Eq.(2.169 1 . 



This means 



Il{ti,t2;x) = 1- 



Mx,t)A,{t2) 
f,{x,t2)A,{t) 



Mx,h)A,{t2) 
Mx,t2)A,{h) 



(2.172) 



(2.173) 



and gives us a probability measure for backwards evolution: the probability of evolving back from t2 to ti is described 
by a Markov process with a flat random number as 



/,(x,ti)A,ft2) 
/,(x,t2)A,;(ti) 



= re[Q, 1] 



with |t2| > 



(2.174) 



While we cannot write down this procedure in a closed form, it shows how we can algorithmic ally generate initial state 
as well as final state parton radiation patterns based on the unregularized DGLAP equation and the Sudakov factors 
solving this equation. One remaining issue is that in our derivation of the collinear re-summation interpretation of the 
parton shower we assume some a strong ordering of the radiated partons which we will discuss in the next section. 



2.5.2 Soft gluon emission 

To this point we have built our parton shower on collinear parton splitting or radiation and its universal properties 



indicated by Eq.(2.96 1. Deriving the diagonal splitting kernels in Eqs.(2.101 1 and (2.1 14 1 we encounter an additional 
source of infrared divergences, namely soft gluon emission corresponding to energy fractions z — > 0, 1. Its radiation 
pattern is also universal, just like the collinear case. One way to study this soft divergence without an overlapping 
collinear pole is gluon radiation off a massive quark with momentum q + k and mass to, which could be attached to 
some hard process as a splitting final state. The initial quark momentum q + k splits into a hard quark q and a soft 
gluon k with k^ <^ q^ = m? 

{q + fcj — TO^ 
= gsT'' e* {k) u{q) [-iY + + - ^1] 



2(gfc) + 0(fc2) 



Mr 



sT^ e*{k) uiq) 



0{k) 



{qk)+0{k^) 



Mn with the Dirac equation u{q){^ — m) — 



sT"" <(fc) ^ u{q) Mr. 



(qjk) 



u{q) Mr 



(2.175) 



The conventions are similar to Eq. p.lOl") , Mn includes all additional terms except for the spinor of the outgoing 
quark with momentum q + k. Neglecting the gluon momentum altogether defines the leading term of the 
eikonal approximation. 

In the last step we simply add all possible sources j of gluon radiation. This defines a color operator which we insert 
into the matrix element and which assumes values of +T^j for radiation off a quark, — IJ, for radiation off an 
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antiquark and —ifabc for radiation off a gluon. For a color neutral process like our favorite Drell-Yan process adding 
an additional soft gluon qq — > Zg it returns J^j Tj = 0. 



The matrix element in Eq.(2.175 1 we need to square. It includes a polarization sum and will therefore depend on the 
gauge. We choose the general axial gauge for massless gauge bosons 



2 V 



pol 



(nk) 



(nky 



k^Tln + Tlfj^ki, 



with a light-like reference vector n obeying = 0. The matrix element squared then reads 



(2.176) 



\M 



n+l\ 



{nk) 



{qjk) 



{q,k) 



[qjk) ] {nk) 




\ ^ rpaJljjJ_ 



\Mn\ 



(2.177) 



The insertion operator in the matrix element has the form of an insertion current multiplied by its hermitian conjugate. 
This current describes the universal form of soft gluon radiation off an n-particle process 



with r^^{k,{q,})^Y.^^ 



(qjk) 



(2.178) 



The squared current appearing in the matrix element squared, Eq.(|2?T77]i, we can further simplify to 




{q,k){qjk) 



2 f°-ff 



{q^k){qjk) 



^T.^^n ( TtMx - - A ) in the general massive case 



iqMljk) 2{q,kf 2{q^ky 



= 2 V ffff , , / , , for massless partons 

^ ' {q^k)iq,k) 

In the last step we only bring the eikonal factor into a different form which sometimes comes in handy because it 
separates the two divergences associated with qi and with qj 



At this point we return to massless QCD partons, keeping in mind that the ansatz Eq.(2.175 1 ensures that the insertion 
currents only model soft, not coUinear radiation. Just as a side remark at this stage — our definition of the insertion 
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current in Eq.(2.178 1 we can generalize to colored processes, where the current becomes dependent on the gauge 
vector n to cancel the n dependence of the polarization sum 



Pj ri_ 

(Pjk) (nk) 



(2.180) 



This dipole radiation term in Eqs.(2.178 i and (2.1801 we can study to see how successive soft gluon radiation will be 



organized for example in terms of emission angles. Given that in the interpretation of the DGLAP equation and its 
solutions the ordering of the collinear emissions plays a crucial role this is an important question. As it will turn out, 
the soft emission case will help us understand this feature. 

At this stage, calling the terms in Eq.( |2.180l l a dipole is a little bit of a stretch if we compare it to a multi-pole series. 
To see the actual dipole structure we would need to look at the color structure. We start by symmetrizing the soft 
radiation dipole with respect to the two hard momenta in a particular way 



{q,k){qjk) 



1 — cos 6ij 

(1 - C0S6'ig)(l - COS9jg) 

1 / 1 — cos 6*,,, 



in terms of opening angles 
1 1 



2 \ (1 — cos Oig){l — cos 0jg) 1 — COS ( 

b] 



1 — cos ( 



+ j) 



(2.181) 

Each of the two terms we need to integrate over the gluon's phase space, including the azimuthal angle integration 



1 

2 Jo 
1 
2 



1 — cos I 



1 



1 



(1 — cos 6*^3) (1 — cos 6jg) 1 — cos 6ig 1 — cos 6jg 



1 — cos ( 



Iff 



1 — cos I 



1 — cos 6jg \ 1 — cos Big 



(2.182) 



To disentangle the different angular integrations we express the three parton vectors in polar coordinates where the 
initial parton i propagates into the x direction, the interference partner j in the (x — y) plane, and the soft gluon in the 
full three-dimensional space 



' cos 0jg = {rtjUg) 



n, = (1,0,0) 
fij — (cos 9ij , sin 9ij , 0) 
fig = (cos 9tg , sin Big cos , sin 9ig sin (pig 

cos 9ij cos 9ig + sin 9ij sin 9ig cos (fiig , 



: 1 



hard parton 
interference partner 
soft gluon 

(2.183) 



From the scalar product between these four- vectors we see that of the terms appearing in Eq.(2.182 1 only the opening 
angle 9jg is related to (f>ig, which for the azimuthal angle integration means 



/ 

^0 



hg 



TT 



- cos 9ij _^ 



1 — cos 9ig 2 \ 1 — cos Big 

cos 9in — cos 9 in 

2 



1 — cos 0ig 



T 1 

/ '^'?^«ff ^1 
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1 1 
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(2.184) 



100 



2 QCD 



The azimuthal angle integral in this expression for W-^ we can solve 

/ d(j)iQ = / d(f>in 

7o 1 — cos djg Jq 1 — cos 9ij cos 9ig + sin Oij sin 0ig cos 

/•2- 1 
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Pig Z 2 
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1 1 
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1/z 



with z = e*"^'" , cos (jiig 



z + 1/z 
2 



i ^ 2az — h ~ bz"^ 
2i f dz 



^ b f {z-z^){z- 
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For the entire integral in Eq.( |2.182 1 this gives us 



(2.185) 
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(2.186) 



The soft gluon is only radiated at angles between zero and the opening angle of the initial parton i and its hard 
interference partner or spectator j. The same integral over W^'j' gives the same result, with switched roles of i and j 
After combining the two permutations the probability of radiating a soft gluon at an angle larger than the earlier 
branching is zero; successive emission of soft gluons obeys an angular ordering. 



Going back to collinear radiation, we know from Eq.( |2.83| l and Eq.( |2.150l ) that the different variables describing 
parton splitting, namely the angle 9, the transverse momentum px, or the virtuality y/t, are equivalent as long as we 
are only interested in the leading logarithm arising from the collinear phase space integration. They can be 
transformed into each other by a simple linear transformation. 

Different implementations of the parton shower order their jet radiation by different variables: historically, of the two 
most widely used event generators PYTHIA uses the virtuality or the transverse momentum while HERWIG uses the 
angle. Once we re-sum the collinear logarithms the finite shifts which appear in the transformation between the 
ordering parameters are also re-summed, so the underlying variable in the parton shower does matter in the 
comparison to data. This is even more true for the matching of the parton shower and the matrix element which we 
will discuss in the next section. 

In this section we have learned an argument for using the angle to order parton radiation: soft radiation is ordered in 
the angle 9, and a parton shower would be well advised to match this behavior On the other hand, in Section 2.3.5 we 



have learned that the physical interpretation of the collinear logarithms re-summed by the parton shower is based on a 
transverse momentum ordering. If theory does not define the golden way to order parton radiation we will have to rely 
on Tevatron or LHC measurements to tell us which implementation works best. This includes the new SHERPA 
parton shower which is not based on the collinear splitting kernels but on the soft-collinear QCD dipoles introduced in 
this section. 
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2.5.3 CKKW and MLM schemes 

The main problem with QCD at the LHC is the range of energy scales of the jets we encounter. Collinear jets with 



their small transverse momentum are well described by a parton shower From Section 2.3.5 we know that strictly 
speaking the parton shower only fills the phase space region up to a maximum transverse momentum < iip. In 
contrast, hard jets with large transverse momentum are described by matrix elements which we compute using the 
QCD Feynman rules. They fill the non-collinear part of phase space which is not covered by the parton shower 
Because of the collinear logarithmic enhancement we discussed in Section [2.3.5| we expect many more collinear and 
soft jets than hard jets at the LHC. 

The natural question then becomes: what is the range of 'soft' or 'collinear' and what is 'hard'? Applying a 



consistency condition we can define collinear jet radiation by the validity of the collinear approximation in Eq.( 2.84 1. 
The maximum px of a collinear jet is the upper end of the region for which the jet radiation cross section behaves like 
1 /pt or the point where the distribution prda / dpT leaves its plateau. For harder and harder jets we will at some point 
become limited by the partonic energy available at the LHC, which means the px distribution of additional jets will 
start dropping faster than 1/pt- Collinear logarithms will become numerically irrelevant and jets will be described by 
the regular matrix element squared without any re-summation. 



Quarks and gluons produced in association with gauge bosons at the Tevatron behave like collinear jets for 
Pt 20 GeV, because quarks at the Tevatron are limited in energy. At the LHC, jets produced in association with 
tops behave like collinear jets to p^ ^ 150 GeV, jets produced with 500 GeV gluinos behave like collinear jets to px 
scales larger than 300 GeV. This is not good news, because collinear jets means many jets, and many jets produce 
combinatorial backgrounds and ruin the missing momentum resolution of the detector: if we are looking for example 
for two jets to reconstruct an invariant mass you can simply plot all events as a function of this invariant mass and 
remove the backgrounds by requiring all event to sit around a peak in mjj. If we have for example three jets in the 
event we have to decide which of the three jet-jet combinations should go into this distribution. If this is not possible 
we have to consider two of the three combinations as uncorrelated 'background' events. In other words, we make 
three histogram entries out of each signal or background event and consider all three background events plus two of 
the three signal combinations as background. This way the signal-to-background ratio decreases from Ns /Nb to 
Ns/{3Nb + 2Ns)- A famous victim of such combinatorics was for a long time the Higgs discovery channel 
pp — >■ ttH with H — ^ bb. 



For theorists this means that at the LHC we have to reliably model collinear and hard jets. For simplicity, in this 
section we will first limit our discussion to final state radiation, for example off the i?-ratio process e+e^ ^ qq from 



Section 2.1.1 Combining collinear and hard jets in the final state has to proceed in two steps. The first of them has 
nothing to do with the parton shower: the problem we need to solve is that the parton shower by construction 
generates a definitive number of jets. We can categorize the generated events by counting the number of jets in the 
final state, i.e. the parton shower events are jet-exclusive. On the other hand, the total rate for the hard process we 
compute as e+e~ qq + X, with any number of collinear jets in the final state denoted by X. Predictions involving 
parton densities and the DGLAP equation are jet-inclusive. Any scheme combining the parton shower and hard matrix 
elements has to follow the path 

1. define jet-exclusive events from the hard matrix elements and the parton shower 

2. combine final states with different numbers of final state particles 

3. reproduce matrix element results in high-p^ and well separated phase space region 

4. reproduce parton shower results for collinear and soft radiation 

5. interpolate smoothly and avoid double counting of events 

For specific processes at the Tevatron the third and fourth point on this list have actually been tackled by so-called 
matrix element corrections in the parton shower Monte Carlos PYTHIA and HERWIG. 

For example the final state of the process e+e^ qq + X often involves more than two jets due to final state splitting. 
Even for the first step of defining jet-exclusive predictions from the matrix element we have to briefly consider the 



102 



2 QCD 



geometry of different jets. To separate jet-inclusive event samples into jet-exclusive event samples we have to define 
some kind of jet separation parameter As a start, we radiate a gluon off one of the quark legs, which gives us a qqg 
final state. This additional gluon can either be collinear with and hence geometrically close to one of the quarks or not. 



Jet algorithms which decide if we count such a splitting as one or two jets we describe in detail in Section 3.2.1 They 



are based on some kind of collinearity measure i/ij which involves the distance in R space Eq.(2.35 1 and the 
transverse momenta. We define two jets as collinear and hence as one jet if yij < j/resoi where ?/,esoi we give to the 
algorithm. As a result, the number of jets in an event will of course depend on this resolution parameter ?/resoi- 

For the second step of combining hard and collinear jet simulation the same resolution parameter appears in a form 
where it becomes a collinear vs hard matching parameter Umntch, is. it allows us to clearly assign each hadron collider 
event a number of collinear jets and a number of hard jets. Such an event with its given number of more or less hard 
jets we can then describe either using matrix elements or using a parton shower, where 'describe' means computing the 
relative probability of different phase space configurations. The parton shower will do well for jets with yij < j/match- 
In contrast, if for our closest jets we find yij > j/match, we know that collinear logarithms did not play a major role, so 
we should use the hard matrix element. If we assign the hard process a typical energy or virtuality scale thaid we can 
translate the matching parameter j/match into a virtuality scale tmatch — 2/match^hai-d, below which we do not trust the hard 
matrix element. For example for the Drell-Yan process the hard scale would be something like the Z mass. 

The CKKW jet combination scheme first tackles the problem of defining jet-exclusive final states. While an 
exclusive rate requires a process to have exactly a given number of jets, an inclusive rate is defined as the number of 
events in which we for example identify n jets and ignore everything else appearing in the event. For example, 
additional collinear jets which we usually denote as '+X' will be included. The main ingredient to translating one 
into the other are non-splitting probabilities, i.e. Sudakov factors. They can transform inclusive n-particle rates into 
exact n-particle rates, with no additional final state jet outside a given resolution scale. Analytically we can compute 
integrated splitting probabilities rj (thaidj t) which for quarks and gluons are implicitly defined through the Sudakov 
factors which we introduce in Eq. ( [2.162] ) 



A5(thard, Wch) =exp ^-^^ y J dy '^Pq^qiy)^ = exp J^^ rq(<hard,i)^ 

Ag(ihard,imatch) =exp j'- / Pfl (4aid, t) + F/ (i)] V (2.187) 

For final state radiation t corresponds to the original ^/p^ and, moving forward in time, is ordered according to 
ihard > t > tmatch- The rcsolution of individual jets we identify with the matrix element-shower matching scale t,natch- 
The y integration in the original definition we can carry out in the leading logarithm approximation, giving us 



r,(thard,i) =r,^9(ihard,i)-^ ^ [ ^ log 



Cp as{t) / 1 thard _ 3 

TT t \2 '^^ t 4 



rf{t)^r,^,{t) (2.188) 

The virtualities thard > t correspond to the incoming (mother) and outgoing (daughter) parton. Unfortunately, this 
formula is somewhat understandable from a probabilistic picture of parton splitting, but not quite. Terms arising from 
next-to-leading logarithms spoil the limit imatch 4aid, where the probability of no splitting should approach unity. 
Technically, we can deal with the finite terms in the Sudakov factors by requiring them to be positive semi-definite, i.e. 
by replacing r(4aid, ^match) < by zero. For the general argument this problem with the analytic expressions for the 
splitting functions is irrelevant; to avoid unnecessary approximations in the y integration more recent CKKW 
implementations integrate the splitting kernels numerically. 

To get a first idea how to transform inclusive into exact n-jet rates we compute the probability to see exactly two jets 



in the process e+e qq. Looking at Figure 1 1 this means that none of the two quarks in the final state radiate a 
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Figure 11: Vetoed showers on two-jet and three-jet contributions. The scale at the gauge boson vertex is <haid- The 
two-jet (three-jet) diagram implies exactly two (three) jets at the resolution scale imatch> below which we rely on the 
parton shower. Figure from Ref . [|25l . 



resolved gluon between the virtualities thard (given by the qqZ vertex) and imatch < ihard- As will become important 
later we specify that this no-radiation statement assumes a jet resolution as given by end point of the external quark 
and gluon legs. The probability we have to multiply the inclusive two-jet rate with is then [Ag(thaid, ^match)]^, once for 
each quark. Whatever happens at virtualities below imatch will be governed by the parton shower and does not matter 



anymore. Technically, this requires us to define a vetoed parton shower which we will describe in Section 2.6.3 



Next, what is the probability that the initially two-jet final state evolves exactly into three jets, again following 
Figure 



11 



[ ? We know that it contains a factor Ag(thard, ^match) for one untouched quark. 
After splitting at tq with the probability rg(iq, ihaid) the second quark survives to imatch, giving us a factor 

tmatch)- If wc assign the virtuality tg to the radiated gluon at the splitting point we find the gluon's survival 
probability Ag(tg, imatch)- So together we find 

Ag(thard, imatch) ^ qithmdi tq) ^q{tq , tmatch) Ag (fg , ttnatch) ' ' ' (2.189) 

That's all there is, with the exception of the intermediate quark. If we label the quark virtuality at which the second 
quark radiates a gluon by tq there has to appear another factor describing that the quark, starting from ihaid, gets to tq 
untouched. Naively we would guess that this probability is given by Aq(thard: tq)- However, this implies no splittings 
resolved at the fixed lower scale tq, but what we really mean is no splitting between ihaid and tg resolved at a third 
scale tmatch < tq givcn by the quark leg hitting the parton shower regime. We therefore better compute the probability 
of no splitting between thard and tq, namely Ag(thard, ^match), but under the condition that splittings from tq down to 
imatch are explicitly allowed. 

If no splitting gives us Ag(thard: ^match)^ foi" onc splitting from tq we add a factor T{tq, t) with an unknown splitting 
point t. This point t we integrate over between the resolution point tmatch and the original splitting point tq. This is the 



same argument as in our physical interpretation of the Sudakov factors solving the DGLAP equation Eq.(2.170i. For 
an arbitrary number of possible splittings between tq and i,natch we find the sum 



Ag(thard5 ^match) 



1 



dt Tq{tq, t) + more splittings 



A q ( t hard 1 ^ match ) 6Xp 



dtTq{tq,t) 



^g(^hard) ^match) 
(j^q 1 ^match) 



(2.190) 



The factors 1/n! in the Taylor series appear because for example radiating two jets in the same t interval can proceed 
ordered in two ways, both of which lead to the same final state. Once again: the probability of nothing happening 
between ^hani and tq we compute from the probability of nothing happening between fhaid and imatch times any number 
of possible splittings between tq and tmatch- 



Collecting all factors from Eq.( |2.189| ) and Eq.( |2.190) l gives us the probability to find exactly three partons resolved at 

Ag(ihard5 ^match) 

(2.191) 



^match as part of the inclusive sample 

Aq (thai-d; ^match) ^ q(t\\axditq) ^q{tq,tjj^^xch) Ag (tg , ^jj^atch) 



^q(tq: tmatch) 



— ^qithardjtq) [Ag (thardj ^match)] Ag (tg , thatch) • 
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This result is what we expect: both quarks go through untouched, just like in the two-parton case. In addition, we 
need exactly one splitting producing a gluon, and this gluon cannot split further. This example illustrates how we can 
compute these probabilities using Sudakov factors: adding a gluon corresponds to adding a splitting probability times 
the survival probability for this gluon, everything else magically drops out. At the end, we only integrate over the 
splitting point tq. 

This discussion allows us to write down the first step of the CKKW algorithm, combining different hard n-jet 
channels into one consistent set of events. One by one we turn inclusive n-jet events into exact n-jet events. We can 
write down the slightly simplified algorithm for final state radiation. As a starting point, we compute all leading-order 
cross sections for n-jet production with a lower jet radiation cutoff at tmatch- This cutoff ensures that all jets are hard 
and that all corresponding cross sections an.i are finite. The second index i describes different non-interfering parton 
configurations for a given number of final state jets, like qqgg and qqqq for n — A. The purpose of the algorithm is to 
assign a weight (probability, matrix element squared,...) to a given phase space point, statistically picking the correct 
process and combining them properly. It proceeds event by event: 

1. for each jet final state (n, i) compute the relative probabiUty P„^i = CT„,i/ X)fe j '^kj 

2. select a final state (n, i) with its probability P„ ^ 

3. assign the momenta from the phase space generator to, assumed, hard external particles 

4. compute the transition matrix element \Ai\^ including parton shower below imatch 

5. use a jet algorithm to compute the shower history, i.e. all splitting virtualities tj in each event. 

6. check that this history corresponds to possible Feynman diagrams and does not violate any symmetries 

7. for each internal and external line compute the Sudakov non-splitting probability down to thatch 

8. re-weight the values of each splitting using the fcy scale from the shower history 

9. combine matrix element, Sudakovs, and into a final weight 

This final event weight we can use to compute distributions from weighted events or to decide if to keep or discard an 
event when producing unweighted events. The construction ensures that the relative weight of the different n-jet rates 
is identical to the probabilities we initially computed. In step 4 the CKKW event generation first chooses the 
appropriate hard scale in the event; in step 5 we compute the individual starting scale for the parton shower applied to 
each of the legs. Following our example, this might be thaid for partons leaving the hard process itself or tg for a 
parton appearing via later splitting. 

In a second step of the CKKW scheme we match this combined hard matrix element with the parton shower, given the 
matching point imatch- From the final experimental resolution scale tresoi up to a matching scale ^match we rely on the 
parton shower to describe jet radiation while above the matching scale jet radiation is explicitly forbidden by the 
Sudakov non-splitting probabilities. Individually, both regimes consistently combine different n-jet processes. All we 
need to make sure is that there is no double counting. 

From the discussion of Eq.(|ZT90]l we know that Sudakovs describing the evolution between two scales and using a 
third scale as the resolution are going to be the problem. Carefully distinguishing the scale of the actual splitting from 
the scale of jet resolution is the key. The CKKW scheme starts each parton shower at the point where the parton first 
appears, and it turns out that we can use this argument to keep the resolution regimes y > ymatch and y < j/match 
separate. There is a simple way to check this, namely if the j/match dependence drops out of the final combined 
probabilities. The answer for final state radiation is yes, as proven in the original paper, including a hypothetical 
next-to-leading logarithm parton shower. The CKKW scheme is implemented in the publicly available SHERPA and 
MadEvent event generators. 

A widely used variant of the CKKW scheme is the MLM scheme , for example implemented in ALPGEN or 
MadEvent. Its main difference to the CKKW scheme is that it avoids computing the survival properties using Sudakov 
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Figure 12: Number of additional jets with a transverse momentum of at least 30, 50 or 100 GeV radiated off top pair 
production and the production of heavy states at the LHC. An example for such heavy states are scalar gluons with a 
mass of 300 or 600 GeV, pair-produced in gluon fusion. Figures from Ref. 1261 . 



form factors. Instead, it attempts to veto those events which CKKW removes by applying the Sudakov non-splitting 
probabilities. This way it avoids problems with splitting probabilities beyond the leading logarithms, for example the 
finite terms appearing in Eq.( |2.188] l which can otherwise lead to a mismatch between the actual shower evolution and 
the analytic expressions of the Sudakov factors. In addition, the veto approach allows the MLM scheme to combine a 
set of n-parton events after they have been generated using hard matrix elements. 

In the MLM scheme we veto events which are simulated the wrong way after the hard matrix element and the parton 
shower have defined a set of complete events. This avoids double counting of events which on the one hand are 
generated with n hard jets from the matrix element and on the other hand appear for example as (n — 1) hard jets with 
an additional jet from the parton shower. After applying a jet algorithm (which in the case of ALPGEN is a cone 
algorithm and in case of MadEvent is a fc-r algorithm) we compare the showered event with the un-showered hard 
event by identifying each reconstructed showered jet with the partons we started from. If all jet-parton combinations 
match and there exist no additional resolved jets we know that the showering has not altered the hard structure of the 
event. This corresponds to adding the Sudakov non-splitting probabilities in the CKKW scheme. If there is a 
significant change between the original hard parton event and the showered event this event has to go. The only 
exception to this rule is the set of events with the highest jet multiplicity for which additional jets can only come from 
the parton shower. After defining the proper exclusive n-jet event sets we can again use the parton shower to describe 
more coUinear jet radiation between thatch and iresoi- 

Unfortunately, the vetoing approach causes a slight problem for the MLM scheme. We still need a backwards 
evolution of a generated event to know the virtuality scales which fix as{Q^)- As a side effect, if we also know the 
Feynman diagrams describing an event we can check that a certain splitting with its color structure is actually 
possible. For this parton shower simulation we need to know the interval of virtualities over which for example the 
additional gluon in the previous two-jet example can split. The lower end of this interval is universally given by tmatch. 
but the upper end we cannot extract from the event record. Therefore, to compute the as values at each splitting point 
we start the parton shower at an universal hard scale, chosen as the hard(est) scale of the process. 
Aside from such technical details all merging schemes are conceptually similar enough that we should expect them to 
reproduce each others' results, and they largely do. But the devil is in the details, so experiment will tell which 
scheme as part of which event generator produces the most usable results to understand LHC data. 



To summarize, we can use the CKKW and MLM schemes to first combine n-jet events with variable n and then 
consistently add the parton shower. In other words, we can for example simulate Z + n jets production at the LHC to 
arbitrarily large numbers of jets, limited only by computational resources and the physical argument that at some point 
any additional jet radiation will be described by the parton shower. This combination will describe all jets correctly 
over the entire coUinear and hard phase space. In Figure 12 we show the number of jets expected to be produced in 
association with a pair of top quarks and a pair of heavy new states at the LHC. The details of these heavy scalar 
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gluons are secondary for the basic features of these distributions, the only parameter which matters is their mass, i.e. 
the hard scale of the process which sets the factorization scale and defines the upper limit of collinearly enhanced 
initial-state radiation. We see that heavy states come with many jets radiated at ^ 30 Ge V, where most of these jets 
vanish once we require transverse momenta of at least 100 GeV. This figure tells us that an analysis which asks for a 
reconstruction of two W decay jets may well be swamped by combinatorics. 



Looking at the individual columns in Figure 12 there is one thing we have to keep in mind: each of the merged matrix 
elements combined into this sample is computed at leading order The emission of real particles is included, virtual 
corrections are not. In other words, the CKKW and MLM schemes give us all jet distributions but leave the 
normalization free, just like an old-fashioned Monte Carlo. Curiously, evaluating total rates we find that merging 
schemes usually give an improved prediction of total rates compared to fixed order calculations, but it is not clear how 
general this feature is and how we can quantify it. Systematically curing this shortcoming will be the topic of the next 
section. 

Before moving on to an alternative scheme we will illustrate why Higgs or exotics searches at the LHC really care 
about progress in QCD simulations: one way to look for heavy particles decaying into jets, leptons and missing energy 
is the variable 

TOeff = IjT + J + Et,1 

j f 

= pT + ^ VT,j + ^ Pt/ (for massless quarks, leptons) (2.192) 



This variable and its relatives we will discuss in detail in Section 3.4.2 For gluon-induced QCD processes the effective 
mass should be small while the new physics signal's effective mass scale will be determined by the heavy masses. 
For QCD jets as well as for W and Z plus jets backgrounds we can study the production of many jets using the 



CKKW scheme. Figure 13 shows the two critical distributions. First, in the number of hard jets we see the so-called 



staircase scaling behavior, namely constant ratios of exclusive (n + l)-jet and n-jet rates (T„+i/(t„. Such a scaling is 



closely related to the pattern we discuss in Eq. (1.134 1, in the context of the central jet veto of Section 1.5.2 The 



particularly interesting aspect of staircase scaling is that the constant ratio is the same for jet-inclusive and 



jet-exclusive cross sections Pinci — Pexc\, as shown in Eq.( 1.135 i 



The consistent variation of Us gives a small parametric uncertainty on these rates. A common scaling factor /i//io for 



all factorization, renormalization and shower scales in the process following our argument of Section 2.4 is strictly 
speaking not fixed by our physical interpretation in terms of resummation; such a factor as part of the leading 
logarithm can be factored out as a subleading finite term, so it should really be considered a tuning parameters for 
each simulation tool. Using the same simulation we also show the effective mass and observe a drop towards large 
values of mgff. However, this drop is nowhere as pronounced as in parton shower predictions. This analysis shows that 
the naive parton shower is not a good description of QCD background processes to the production of heavy particles. 
Using a very pragmatic approach and tune the parton shower to correctly describe LHC data even in this parameter 
region will most likely violate basic concepts like factorization, so we would be well advised to use merging schemes 
like CKKW or MLM for such predictions. 

2.6 Next-to-leading orders and parton shower 



As we know for example for the R ratio from Section p.l.l| the precision of a leading order QCD calculation in terms 
of the strong coupling constant Ug is not always sufficient to match the experimental accuracy. In such a case we need 



to compute observables to higher order in QCD. On the other hand, in Section 2.3.5 we have seen that the parton 
shower does not respect a fixed order perturbation theory. With its coUinear logarithm it sums particular terms to all 
orders in a^. So how can we on the one hand compute higher order corrections to for example the Drell-Yan cross 
section and distributions and in addition consistently combine them with the parton shower? 

Such a combination would remove one of the historic shortcomings of parton shower Monte Carlos. Apart from the 
coUinear approximation for jet radiation they are limited by the fact that they 'only do shapes'. In other words, the 
normalization of the simulated event sample will always be leading order in perturbative QCD and hence object to 
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Figure 13: Exclusive number of jets and effective mass distributions for pure QCD jet events at the LHC with a center- 
of-mass energy of 7 TeV and ptj > 50 GeV. The curves including the uncertainty and a scale variation (tuning 
parameter) are computed with SHERPA and a fully merged sample including up to six hard jets. These distributions 
describe typical backgrounds for searches for jets plus missing energy with fake missing energy, which could originate 
from supersymmetric squark and gluino production. Figures from Ref. STD . 



large theoretical uncertainties. The reason for this shortcoming is that coUinear jet radiation relies on a hard process 
and the corresponding production cross section and works with splitting probabilities, but never touches the total cross 
section it started from. 

As a solution we compute higher order cross sections to normalize the total cross section entering the respective 
Monte Carlo simulation. This is what we call a K factor : K = cr™P™^'=Vc^'^'^ = cr™P'"°^<=7crL0 crucial to 
remember that higher order cross sections integrate over unobserved additional jets in the final state. So when we 
normalize the Monte Carlo we assume that we can first integrate over additional jets and obtain (j'™p"™'^ and then just 
normalize the Monte Carlo which puts back these jets in the collinear approximation. Obviously, we should try to do 
better than that, and there are two ways to improve this traditional Monte Carlo approach, the MC@NLO scheme and 
the POWHEG scheme. 



2.6.1 Next-to-leading order in QCD 

When we compute the next-to-leading order correction to a cross section, for example to Drell-Yan production, we 
consider all contributions of the order Gpcts- There are three obvious sets of Feynman diagrams we have to square 
and multiply, namely the Born contribution qq — > Z, the virtual gluon exchange for example between the incoming 
quarks, and the real gluon emission qq Zg. An additional set of diagrams we should not forget are the crossed 
channels qg Zq and qg Zq. Only amplitudes with the same external particles can be squared, so we find the 
matrix-element-squared contributions 

\Mb\^'^Gf 

2Re A^t-Xs cx Gpa, |A^zg|'oc Gf^s \Mzq\MMzq\^ Gpas . (2.193) 



Strictly speaking, we have to include counter terms, which following Eq.(2.54i are a modification of |A^Bp. These 
counter terms we add to the interference of Born and virtual gluon diagrams to remove the ultraviolet divergences. 
However, this is not the issue we want to discuss. 



Infrared poles arise from two sources, soft and colUnear divergences. To avoid the complication of overlapping 
collinear and soft divergences we will follow a toy model by Bryan Webber It describes simplified particle radiation 
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off a hard process: the energy of the system before radiation is Xg and the energy of the outgoing particle (call it 
photon or gluon) is x, so x < Xg < 1- When we compute next-to-leading order corrections to a hard process, the 
different contributions, neglecting crossed channels, are 



da 
dx 



BS{x) 



da 

dx 



-^^+V]S{x) 



da 
dx 



R{x) 



(2.194) 



The constant B describes the Bom process and the factorizing poles in the virtual contribution. The coupling constant 
as ignores factors 2 and tt or color factors. We immediately see that the integral over x in the real emission rate is 
logarithmically divergent in the soft limit, similar to the collinear divergences we know. From factorization which we 
have illustrated based on the universality of the leading splitting kernels we know that in the collinear and soft limits 
the real emission has to follow the Born matrix element 



lim R{x) = B 



(2.195) 



An observable computed beyond leading order includes contributions from real gluon emission and virtual gluon 
exchange. If the observable is infrared safe it will have a smooth limit towards vanishing gluon energy 0{x) — > 0(0). 
The virtual corrections alone diverge, but the expectation value including virtual and real gluon contributions after 
dimensional regularization is finite. Because we are interested in infrared divergences we choose n = 4 + 2e 
dimensions with e > 0, just like in Sectionp.3.3| and schematically obtain the two divergent contributions 



(0) 



dx 



0{x) 0(0) 



rl-2€ 



2e 



(2.196) 



This kind of combination has a finite limit for e 
need to implement this cancellation differently. 



0. However, for numerical applications and event simulation we 



The expectation value of any infrared safe observable over the entire phase space, including Born terms, virtual 
corrections and real emission, is given by 



(O) = {0)B + {0)v + {0)n = fip 



2e 



Jo X 


da 




da 




da 




dx 


B 


dx 


V 


dx 


R 



(2.197) 



The same way the renormalization and factorization scales appear, dimensional regularization now yields an 
additional factor l/x^^*^. Because we know its structure, we will omit this factorization scale factor in the following. 



When we compute for example a distribution of the energy of one of the heavy particles in the process, we can extract 



a histogram from of the integral for (O) in Eq.(2.197 i and obtain a normalized distribution. The problem is that we 



have to numerically integrate over x, and the individual parts of the integrand in Eq.(2. 197 i are not integrable. 



There exist two methods to combine the virtual and real contributions to an observable and produce a finite physics 
result. The first way historically introduced by the Dutch loop school for example to compute QCD corrections to top 
pair production is the numerically highly efficient phase space slicing: we divide the divergent phase space integral 
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into a finite part and a pole, by introducing a small parameter A, which acts like 

0{x) da 



{0)r + {0)v 



Q X ^'^ dx 



{Oh 
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dx as J-^ +{0)v 
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=a^i?(0) 0(0) / dx + a 
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--asBO{0) 



, R{x)Oix) 
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X 



BO{0) 2e log A + 0(62) 

=as ^ h a, 

^ e ./A 



^asBO{0) logA + a^ [ dx 
J A 



R{x)0{x) 



asVO{Q)+0{e) 



using Eq.( |2.194 i 

(2.198) 



The two sources of log A dependence have to cancel in the final expression, so we can evaluate the integral at finite 
but small values of A. An amusing numerical trick is to re-write the explicit log A contribution into a NLO-type phase 
space integral. If the eikonal approximation is given in terms of a Mandelstam variable (^(54) and the cut-off has mass 
dimension two we can write 



, A 



dsi log 5{si) = 

M Jo 



dsi 



„max 
gmax _ ^ 



1 

S4 



(2.199) 



and similarly for log A. This representation we can integrate along with the real emission phase space. The result 
will be a finite value for the next-to-leading order rate in the limit A and exactly e = 0. 



The fundamental shortcoming of phase space slicing is that it requires an analytical integration to produce sensible 
observable distributions. To avoid cancellations between integrals and replace them by cancellations among 
integrands we use a subtraction method to define integrable functions under the x integral in Eq. p.l97| ). While our 
toy model appears more similar to the Frixione-Kunszt-Signer subtraction scheme than to the Catani-Seymour 
scheme, both of them really are equivalent at the level of the soft-collinear toy model. Starting from the individually 
divergent virtual and real contributions we first subtract and then add again a smartly chosen term, in this toy model 



identical to a plus-subtraction following Eq.(2. 124 1 
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using Eq.(2.194i . 



{Oh 



(2.200) 



In the subtracted real emission integral we take the limit e — > because the asymptotic behavior of R{x 0) 
regularizes this integral without any dimensional regularization required. The first integral precisely cancels the 
divergence from the virtual correction. We end up with a perfectly finite x integral for the sum of all three 
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contributions, even in the limit e = 0, i.e. there is no numerically small parameter in the expression 



(O) = {0)b + {0)v + {0)r = B 0(0) + asV 0(0) + a, / dx 

Jo 

r-l 

dx 



Rix) O{x)-BO{0) 



0(0) [B + a,V-as-)+ 0(a;) a, ^ 
X J x 



(2.201) 



This subtraction procedure is a standard method to compute next-to-leading order corrections involving one-loop 
virtual contributions and the emission of one additional parton. 

As a side remark, we can numerically improve this expression using a distribution relation 



\x 



„l-2c 



^.M^%f^+/(0) fd..— 

^ "'0 

^ dx M^;~"V-^c --^;m"; /-I , o,i„„^ , ^o^,2 

^ 

1 



(l + 2elogx + O(e2))+/(0) 



2e 



, /(.)-. (..-.)/(o) ^ m-oix.^x)m ^^^^ ^ ^ 

x a; 2e 



2e 



logx 



(2.202) 



where the last line is a relation between appropriately defined distributions. This c-subtraction first introduced as part 
of the Frixione-Kunszt-Signer subtraction scheme is defined as 



dx f{x) g{x)c= / dx {J{x)g{x) ~ J{Q)g{x)e{xc - x)) 

"'0 



(2.203) 



and is a generalization of the plus subtraction defined in Eq.(2.124 1 which we reproduce choosing Xc ~ 1. Linking the 
delta distribution to the divergent integral over l/x it is also reminiscent of the principal value integration, but for an 
endpoint singularity and a dimensionally regularized phase space. Effectively combining phase space subtraction 



Eq.(2.200i and phase space slicing Eq.(2.198 i, we include a cutoff in the integrals holding the subtraction terms 

R(x)0{x) 



{0)r =a. 



dx 



la ^ 

/ dx 
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l-2e 

R{x)Oix) - e{x, ~ x)B 0(0) 
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(2.204) 



The dependence on the finite cutoff parameter Xc drops out of the final result. The numerical behavior, however. 



should be improved if we subtract the infrared divergence only close to the actual pole where following Eq.(2.195 i we 
understand the behavior of the real emission amplitude. 



The formula Eq.( 2.201 1 is, in fact, a little tricky: usually, the Born-type kinematics would come with an explicit factor 
S{x), which in this special case we can omit because of the integration boundaries. We can re- write the same formula 
in a more appropriate way to compute distributions, possibly including experimental cuts 



da 
dO 



dx 



I{0)lo [B + a,V-a,-)+ /(O)nlo a. — 

X J X 



(2.205) 



The transfer function /(O) is defined in a way that formally does precisely what we require: at leading order we 
evaluate /(O) using the Bom kinematics x — while for the real emission kinematics it allows for general a; = • • • 1. 



2.6 Next-to-leading orders and parton shower 



111 



2.6.2 MC @ NLO method 

For example in Eq.( |2.197] l we integrate over the entire phase space of the additional parton. For a hard additional 
parton or jet the cross section looks well defined and finite, provided we fully combine real and virtual corrections. An 
infrared divergence appears after integrating over small but finite a; — > from real emission, and we cancel it with an 
infrared divergence in the virtual corrections proportional to a Born-type momentum configuration S{x). In terms of a 
histogram in x we encounter the real emission divergence at small x, and this divergence is cancelled by a negative 
delta distribution at a; = 0. Obviously, this will only give a well behaved distribution after integrating over at least 
small X. 

This soft and collinear subtraction scheme for next-to-leading order calculations leads us to the first method of 
combining or matching next-to-leading order calculations with a parton shower Instead of the contribution from the 
virtual corrections contributing at 5{x) what we would rather want is a smeared virtual corrections pole which 
coincides with the justified collinear approximation and cancels the real emission over the entire low-a; range. This 
contribution we can view as events with a negative weight, i.e. counter-events. Negative events trigger negative 
reactions with experimentalists, but there is really no problem with them as long as any physical prediction we make 
after adding all leading order and next-to-leading order contributions gives a positive cross section. 
Because we know they describe collinear jet radiation correctly such a modification will make use of Sudakov factors. 
We can write them as a function of the energy fraction z and find dV = asP{z) / z dz. Note that we avoid the 
complicated proper two-dimensional description of Eq.( |2.162] i in favor of the simpler picture just in terms of particle 
energy fractions as introduced in the last section. 

Once we integrate over the entire phase space this modified subtraction scheme has to give the same result as the 
next-to-leading oder rate. Smearing the integrated soft-collinear subtraction term using the splitting probabilities 
entering the parton shower means that the MC@NLO subtraction scheme has to be adjusted to the parton shower we 
use. 



Let us consider the perturbatively critical but otherwise perfectly fine observable, the radiated photon spectrum as a 
function of the (external) energy scale z. We know what this spectrum looks like for the collinear and hard kinematic 
configurations 
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dz 



BP{z) 



LO 
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R{z) 



NLO 



(2.206) 



The first term describes parton shower radiation from the Born diagram at order a^, while the second term is the hard 
real emission defined in Eq.( 2. 194 1. The transfer functions we would have to include in the general form of Eq.( 2.205 1 
to arrive at this equation are 



I{z, xm) 



P{z) 



LO 



P(z) 
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(2.207) 



The second term in the real radiation transfer function arises because at the next order in perturbation theory the 
parton shower also acts on the real emission process. It requires that enough energy to radiate a photon with an energy 
z be available, where xm is the energy available at the respective stage of showering, i.e. z < xm- 
These transfer functions we can include in Eq.( 2.205 i 
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All Bom terms proportional to 5{z) vanish because their contributions would be unphysical. This already fulfills the 
first requirement for our scheme, without having done anything except for including a transfer function. Now, we can 
integrate over z and calculate the total cross section (Ttot with a cutoff Zjnin for consistency. However, Eq.( 2.208 i 
includes an additional term which spoils the result: the same kind of jet radiation is included twice, once through the 
matrix element and once through the shower This is precisely the double counting which we avoid in the CKKW 
scheme. So we are still missing something. 



We also knew we would fall short, because our strategy includes a smeared virtual subtraction term which for finite x 
should cancel the real emission. This subtraction is not yet included. Factorization tells us how to write such a 
subtraction term using the splitting function P as defined in Eq.( |2.206 1 to turn the real emission term into a finite 
contribution 

Rixl ^ Rjx) - BPjx) ^^^^^^ 



Because this is an ad hoc subtraction term we also have to add it to the Born-type contribution. This leads us to a 



modified version of Eq.( 2.205 i, now written for general observables 



da 
dO 



dx 



I{0,1) [B + asV 
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Looking back at different methods of removing ultraviolet divergences this modification from the minimal soft and 



collinear subtraction in Eq.( 2.205 1 to a physical subtraction term corresponding to the known radiation pattern 
reminds us of different renormalization schemes. The minimal MS scheme will always guarantee finite results, but for 
example the on-shell scheme with its additional finite terms has at least to a certain degree beneficial properties when 
it comes to understanding its physical meaning. This is the same for the MC@NLO method: we replace the minimal 
subtraction terms by physically motivated non-minimal subtraction terms such that the radiation pattern of the 
additional parton is described correctly. 



So when we use this form to compute the z spectrum to order as it will in addition to Eq.( 2.208 1 include an integrated 
subtraction term contributing to the Bom-type kinematics 



da 

dz 



dx 



dx a, 



BP{z) ^. . fR{x) BP{x) 
as — + as S{x - z) — 

Z \ X X 



BP{z) + R{z) - BP{z) 



as^+0{al) 



0{al) 



0{al 
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This is exactly the distribution we expect. 



Following the above argument the subtraction scheme implemented in the MC@NLO Monte Carlo describes hard 
emission just like a next-to-leading order calculation. This includes the next-to-leading order normalization of the rate 
as well as the next-to-leading order distributions for the particles produced in the hard process. For example for 
M^4-jets production such corrections to the W and leading jet distributions matter, while for the production of heavy 
new particles their distributions hardly change at next-to-leading order The distribution of the first radiated parton is 
included at leading order, as we see in Eq.( |2.21l"] i. Finally, additional collinear particle emissions is simulated using 
Sudakov factors, precisely Uke a parton shower. 

Most importantly, it avoids double counting between the first hard emission and the collinear jets, which means it 
describes the entire pT range of jet emission for the first and hardest radiated jet consistently. Those additional jets, 
which do not feature in the next-to-leading order calculation, are added through the parton shower, i.e. in the collinear 
approximation. As usually, what looked to easy in our toy example is much harder in QCD reality, but the setup is the 
same. 



2.6 Next-to-leading orders and parton shower 
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2.6.3 POWHEG method 



As described in Section p.6.2| the MC@NLO matching scheme for a next-to-leading order correction and the parton 
shower is based on an extended subtraction scheme. It starts from a given parton shower and avoids double counting 
by modifying the next-to-leading corrections. An interesting question is: can we also combine a next-to-leading order 
calculation by keeping the next-to-leading order structure and apply a modified parton shower? The main ingredient to 



this structure are Sudakov factors introduced in Section 2.5.1 and used for the CKKW merging scheme in 
Section 12331 

In contrast to the MC@NLO scheme the POWHEG (Positive Weight Hardest Emission Generator) scheme does not 
introduce counter events or subtraction terms. It considers the next-to-leading order calculation of a cross section a 
combination of an m-particle and an (to + l)-particle process and attempts to adjust the parton shower attached to 
each of these two contributions such that there is no double counting. 



Our starting point is the next-to-leading order computation of a cross section following Eg. p. 194] ). We can combine it 
with appropriate soft and collinear subtraction terms C in the factorized (to + 1) -particle phase space where for 
simplicity we assume that the integrated subtraction terms exactly cancel the divergences from the virtual corrections: 
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after soft-collinear subtraction 
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(2.212) 



The (to + 1) -particle phase we factorize into the TO-particle phase space and the azimuthally symmetric remainder 
dzdt', as suggested by Eq.(2.96 1. With these two proper integration variables we cannot assume a toy form R/x for 
the real emission, so we denote the real emission as R alone. The projector P maps the nominal (to + l)-particle 
phase space of the real emission onto the TO-particle phase space of the leading order process. 



The first term in Eq.(2.212i is suppressed by one power of ag, so we can add a parton shower to it without any worry. 



The second term consists of the Born contribution and the hard emission of one parton, so we have to avoid double 



counting when defining the appropriate Sudakov factors. Moreover, a serious problem appears in Eq.(2.212 1 when we 
interpret it probabilistically: nothing forces the combination of virtual and subtracted real emission in the first brackets 
to be positive. To cure this shortcoming we can instead combine all TO-particle contributions into one term 
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(2.213) 



where the combination B can only become negative if the regularized next-to-leading contribution over-compensates 
the Born term which would indicate a breakdown of perturbation theory. The second term in the brackets needs to 
project the real emission onto the hard (m + l)-particle phase space. If we replace the symbolic projection (1 — P) by 
a step function in terms of the transverse momentum of the radiated parton prit' , z) we can ensure that it really only 
appears for hard radiation above p™" and at the same time keep the integral over the radiation phase space finite. 



From CKKW jet merging we know what we have to do to combine an TO-particle process with an (to + l)-particle 
process, even in the presence of the parton shower: the TO-particle process has to be exclusive, which means we need 



to attach a Sudakov factor A to veto additional jet radiation to the first term in the brackets of Eq.(2.213 i. In the 
CKKW scheme the factor in the front of the brackets would be B and not B. The introduction of B is nothing but a 
re-weighting factor for the events contributing to the TO-particle configuration which we need to maintain the 
next-to-leading order normalization of the combined TO-particle and (to + l)-particle rates. The second factor agR/B 
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is essentially the multiplicative PYTHIA or HERWIG matrix element correction used for an improved simulation for 
example of M^+jet events. 

The appropriate Sudakov factor for the real emission has to veto only hard jet radiation from an additional parton 
shower This way we ensure that for the (m + l)-particle contribution the hardest jet radiation is given by the matrix 
element R, i.e. no splitting occurs in the hard regime pT > p™". Such a vetoed shower we can define in analogy to 
the (diagonal) Sudakov survival probability Eq.(2.162 1 by adding a step function which limits the unwanted splittings 

to PT > Pt'" 



A(i,pr)=exp 



dz '^P{z)9{pT{t\z)-pr) 



(2.214) 



omitting the resolution to in the argument. This modified Sudakov factor indicates that in contrast to the MC@NLO 
method we now modify the structure of the parton shower which we combine with the higher order matrix elements. 
For the vetoed Sudakov factors to make sense we need to show that they obey a DGLAP equation like Eq.(2.170 1, 
including the veto condition in the splitting kernel 



fix,t) = A{t,p'^"')fix,to)+ /^^ A(i,i',pr) r P{z)e{pT{t',z)~p^'")f(^,t') , (2.215) 

Jtg t Jq Z ZTT \Z / 

where we again show the diagonal case to simplify the notation. The proof of this formula starts from Eq.( |2.170l i with 
the modification of an explicit veto. Using 1 — 9{g) + (1 — 6{g)) we find Eq.(2.215 1 more or less straight away. The 
bottom line is that we can consistently write down vetoed Sudakov probabilities and build a parton shower out of them. 



^ dz a. 



Inserting both Sudakov factors into Eq. p.213[ ) gives us for the combined next-to-leading order exclusive contributions 

(2.216) 
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The first Sudakov factor is not vetoed which means it is evaluated at p™" 



0. 



Based on the next-to-leading order normalization of the integrated form of Eq.(2.216 1 we can determine the form of 
the splitting probability entering the Sudakov factor from the perturbative series: the term in brackets integrated over 
the entire phase space has to give unity. Starting from Eq.( 2.215| ) we first compute the derivative of the Sudakov factor 
with respect to one of its integration boundaries, just like in Eq.(|2.166 1 
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Using this relation we indeed find for the integral over the second term in the brackets of Eq.( |2.216l ) 



dt'dzA{t',p^" 



asR 



(PT(<',^)-pr) 



dt 



df 



, dA(t',p^" 
~dP 

, dA{t',pf' 



Jdz- 



B 



{PT{t',z) 



jdz^,p{z)e{pT{t'.z)~pT) 



to 



dt' 



-A{t,pT) 



OLsR 



2TTf 



(2.218) 



Looking back at Eq.(2.96 1 this corresponds to identifying B = (T„ and a^i? = (t„+i. In the POWHEG scheme the 
Sudakov factors are based on the simulated splitting probability UgR/B instead of the splitting kernels. This 
replacement is nothing new, though. We can already read it off Eq.( [2.96| l. 



2. 7 Further reading 



115 





MC@NLO/POWHEG matching 


CKKW/MLM merging 


hard jets 
collinear jets 
normalization 
implementations 


first jet correct 

all jets correct, tuned 

correct to NLO 

MC@NLO, POWHEG, SHERPA, HERWIG 


all jets correct 

all jets correct, tuned 

correct to LO plus real emission 

SHERPA, Alpgen, MadEvent,... 



Table 3: Comparison of the MC@NLO and CKKW schemes combining collinear and hard jets. 



A technical detail which we have not mentioned yet is that the POWHEG scheme assumes that our Sudakov factors 
can be ordered in such a way that the hardest emission always occurs first. Following the discussion in Section [2.5.2| 
we expect any collinear transverse momentum ordering to be disrupted by soft radiation, ordered by the angle. The 
first emission of the parton shower might well appear at large angles but with small energy, which means it will not be 
particularly hard. 

For the POWHEG shower this soft radiation has to be removed or moved to a lower place in the ordering of the 
splittings. The condition to treat soft emission separately we know from CKKW merging, namely Eq.( |2.190l ): the 
scale at which we resolve a parton splitting does not have to identical with the lower boundary of simulated splittings. 
We can construct a parton shower taking into account such splitting kernels, defining a truncated shower . This 
modified shower is the big difference between the MC@NLO scheme and the POWHEG scheme in combining 
next-to-leading order corrections with a parton shower. In the MC@NLO scheme we modify the next-to-leading order 
correction for a given shower, but the shower stays the same. In the POWHEG scheme the events get re-weighted 
according to standard building blocks of a next-to-leading order calculation, but the shower has to be adapted. 

In Sections [2. 5. 3| and Sections [2. 6. 2|2. 6.3] we have introduced different ways to simulate jet radiation at the LHC. The 
main features and shortcomings of the matching and merging approaches we summarize in Table |3] 
At this stage it is up to the competent user to pick the scheme which describes their analysis best. First of all, if there 
is a well defined and sufficiently hard scale in the process, the old-fashioned Monte Carlo with a tuned parton shower 
will be fine, and it is by far the fastest method. When for some reason we are mainly interested in one hard jet we can 
use MC@NLO or POWHEG and benefit from the next-to-leading order normalization. This is the case for example 
when a gluon splits into two bottoms in the initial state and we are interested in the radiated bottom jet and its 
kinematics. In cases where we really need a large number of jets correctly described we will end up with CKKW or 
MLM simulations. However, just like the old-fashioned parton shower Monte Carlo we need to include the 
normalization of the rate by hand. Or we are lucky and combined versions of CKKW and POWHEG, as currently 
developed by both groups, will be available. 

I am not getting tired of emphasizing that the conceptual progress in QCD describing jet radiation for all transverse 
momenta is absolutely crucial for LHC analyses. If 1 were a string theorist 1 would definitely call this achievement a 
revolution or even two, like 1917 but with the trombones and cannons of Tchaikovsky's 1812. In contrast to a lot of 
other progress in theoretical physics jet merging solves a problem which would otherwise have limited our ability to 
understand LHC data, no matter what kind of Higgs or new physics we are looking for. 

2.7 Further reading 

Just like the Higgs part the QCD part of these lecture notes is something in between a text book chapter and a review 
of QCD and mostly focused on LHC searches. Some corners I cut, in particular when calculations do not affect the 
main topic, namely the resummation of logarithms in QCD and the physical meaning of these logarithms. There is no 
point in giving a list of original references, but 1 will to list a few books and review articles which should come in 
handy if you would like to know more: 

- 1 started learning high energy theory including QCD from Otto Nachmann's book. 1 still use his appendices for 
Feynman rules because 1 have not seen another book with as few (if not zero) typos |28 1. 

- similar, but maybe a little more modern is the Standard Model primer by Cliff Burgess and Guy Moore At 
the end of it you will find more literature. 
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the best source to learn QCD at colliders is the pink book by Keith Ellis, James Stirling, and Bryan Webber It 
includes everything you ever wanted to know about QCD [24] and more. This QCD section essentially follows 
its Chapter 5. 

a little more phenomenology you can find in Giinther Dissertori, Ian Knowles and Michael Schmelling's 
book t30j . Again, I borrowed some of the discussions in the QCD section from there. 

if you would like to learn how to for example compute higher order cross sections to Drell-Yan production. 
Rick Field works it all out 1 3 1 1 . 

for those of you who are now hooked on QCD and jet physics at hadron colliders there are two comprehensive 
reviews by Steve Ellis 1321 and by Gavin Salam ll33l . 

aimed more at perturbative QCD at the LHC is the QCD primer by John Campbell, Joey Huston, and James 
Stirling Q. 

coming to the usual brilliant TASI lectures, there are Dave Soper's ll34l and George Sterman's ||35]| notes. Both 
of them do not exactly use my kind of notations and are comparably formal, but they are a great read if you 
know something about QCD already. More on the phenomenological side there are Mike Seymour's lecture 
notes Il36ll . 

the only review on leading order jet merging is by Michelangelo Mangano and Tim Stelzer OTl . The original 
CKKW paper beautifully explains the general idea for final state radiation, and I follow their analysis (2^. For 
other approaches there is a very concise discussion included with the comparison of the different models ll38l . If 
you want to know more, you will have wait for the review article which Frank Krauss and Peter Richardson 
promised to write for Journal of Physics G. 

to understand MC@NLO there is nothing like the original papers by Bryan Webber and Stefano Frixione ll39l . 

the POWHEG method is really nicely described in the original paper by Paolo Nason BOl . Different processes 
you can find discussed in detail in a later paper by Stefano Frixione, Paolo Nason, and Carlo Oleari Il4l1l . 

even though they are just hand written and do not include a lot of text it might be useful to have a look at 
Michael Spira's QCD lecture notes |42 | to view some of the topics from a different angle. 
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3 LHC Phenomenology 

While the two first parts of these lecture notes focus on Higgs physics and on QCD, biased towards aspects relevant to 
the LHC, they do not yet justify the title of the lecture notes. In addition, both introductions really are theoretical 
physics. The third section will reflect what many theorists working on LHC topics need to know. It goes beyond what 
you find in theoretical physics text books and is usually referred to as 'phenomenology' . 

This terms indicates that these topics are not really theoretical physics in the sense that they rely on for example field 
theory. They are not experimental physics either, because they go beyond understanding the direct results of the LHC 
detectors. Instead, they lie in between the two fields and need to be well understood to allow theorists and 
experimentalists to interact with each other. 

Sometimes, phenomenology has the reputation of not being proper theoretical physics. From these lecture notes it is 
clear that LHC physics is field theory, either electroweak symmetry breaking, QCD, or — not covered in these notes 
— physics models describing extensions of our Standard Model at the TeV scale. This chapter supplements the pure 
theory aspects and links them to experimental issues of the ATLAS and CMS experiments. Before we move into 



phenomenological territory, in Section 3.1 we remind ourselves of the perturbative field theory background of our 



calculations. In Section 3.2 we fill in some blanks from Section 1.4.3 1.6 and 2.5.3 We first discuss jets and how to 



link the asymptotic final states of QCD amplitudes, partons, to experimentally observed QCD objects, jets. Then, we 



turn to a current field of research, so-called fat jets. In Section 3.3 we introduce a particularly efficient way of 



computing transition amplitudes from Feynman rules, the helicity method. Most professional tools computing LHC 



cross sections or simulating LHC events use this method instead of squaring amplitudes analytically. Section 3.4 
discusses how to reconstruct particles which interact too weakly to be observed in LHC detectors. In the Standard 
Model those would be neutrinos, but as part of the LHC program we hope to find dark matter candidates that way. 



Finally, in Section 3.5 we very briefly discuss LHC uncertainties from a theory point of view. In the public arXiv 
version more sections will follow, hopefully triggered by LHC measurements challenging theorist and their 
simulations. 

3.1 Not using Feynman rules 

The first section in this phenomenology part is actually the opposite of what it should be; instead of covering a 
phenomenological topic we very briefly review the kind of formal calculation which phenomenologists take for 
granted — the computation of a transition amplitude from first principles. This short section is meant to work by 
itself, so while there are obvious links to calculations elsewhere in these lecture notes, they are not necessary to follow 
this part. 

When we compute transition amplitudes for collider like LEP with incoming electrons and positrons or LHC with 
incoming quarks and gluons, we usually combine building blocks defined by Feynman rules in a way which does not 
make it obvious that we are dealing with a field theoretical quantum calculation. For example, in Section 2.1 we 
compute the transition amplitude for the process e+e^ — > 7* — > through a photon, all starting from these Feynman 
rules. In this section, we will give a brief sketch of what we have to do to describe the quantum fields involved and to 
compute this transition amplitude without using Feynman rules. 

From theoretical mechanics we remember that there are several ways to describe a physical system and calculate the 
time evolution of the particles involved. One object to start from is the action as a function of one degree of freedom 
or field (j) 

J d^a;if((/),a^0) . (3.1) 

With X we usually denote the 4-vector including the time component {xq, x). The action has to be invariant under a 
variation dS = 0. This condition we can translate into the Euler-Lagrange equations 



'The term 'phenomenology' is borrowed from philosophy where it means exactly the opposite from what it means in physics. Originally, 
phenomenology is a school based on Edmund Husserl. Unlike other fields of philosophy their main interest are not observations but the actual nature 
of things. Doing exactly the opposite, physicist phenomenologists are theorists who really care about measurements. 
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We use the notation 9^ = d/dx^^. A convenient second parameter in addition to x is the conjugate momentum 
7r(x) — ld(\). With this coordinate we can define the third object which we can use to describe the dynamics of a 
system, the Hamiltonian density 

£-xn{x)= I (fix (tt^~^) . (3.3) 



While for example in quantum mechanics this Hamiltonian or energy functional is the basis of most calculations, in 
field theory we usually start from the Lagrangian. 

We already know that for our scattering process we need to compute a transition amplitude between two kinds of 
matter particles, namely incoming electrons and outgoing quarks, interacting via their electric charges. The interaction 
is classically described by the electromagnetic Lagrangian based on the abelian U{1) field theory 

With F^, ^ d^A, ~ d,A^ , (3.4) 
in terms of a photon vector field A^. The Maxwell equations 

= a^Fp. = d^d^A, - d'^d.A^ = DA, (3.5) 

are the equations of motion for this photon field. In the last step we assume the Lorentz gauge condition dfj,A'^ = 
and find the d' Alembert equation for the vector potential A^^. 

To avoid carrying the vector index of the photon field we instead use a real scalar field cj) to illustrate bosonic fields, 
how we quantize them, what its equation of motions are, and how it enters a calculation. Including a mass for this real 
scalar field we can write down its equation of motion which is the same for a spin-0 scalar boson as for the spin-1 



vector boson of Eq.( 3.5 i 



(□ + m^) 0(2;) = . (3.6) 

This Klein-Gordon equation is completely equivalent to the d' Alembert equation for the electromagnetic vector 
potential in Lorentz gauge. This equation of motion corresponds to a contribution to the Lagrangian of the form 

1 IT} 

i^D 0^ (3.7) 



which we can easily confirm using the Euler-Lagrange equation Eq.( 3.2 1. 
Under a Lorentz transformation 

x^" ^ x"" = A^',x'' 
(j){x) -> (jj'ix) = (j){A-^x) (3.8) 

of the d' Alembert operator and of the scalar field, both of which are invariants, the Klein-Gordon equation keeps its 
form in the new coordinates. It is a standard wave equation which we can solve using plane waves, which modulo 
prefactors gives us 

(27r)32A:o 

Complex conjugates a and a* are required for a real field (f). The value of /cq is given by the dispersion relation, which 
means that if (f) fulfills the Klein-Gordon equation it is /cq = fc^ + m^. The Fourier transform a therefore explicitly 
'only depends on k. Up to this point the field (/> is a real function of the space coordinates, i.e. it is not quantized. 

Because in the remaining lecture we will only use quantized operator valued fields we will not change notation at this 



^(^)-/f2^ (e--a*(fc) + e--a(fc)) . (3.9) 



point. Switching from a field function to a field operator (j) leaves the Klein-Gordon equation Eq.( 3.6 1 the same. 



except that now it constrains an operator (j){x) which for example we cannot simply commute with other operators. 



used to be a real field, so now it will be a hermitian operator 0' — (j). In the plane wave solution Eq.( 3.9 1 the 
coefficients a and (now) are promoted to operators as well. 



3. 1 Not using Feynman rules 
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The Hilbert space in which our system resides includes a vacuum state |0) which has to be normalized (0|0) = 1 and 
has zero energy and momentum. We can show that the on-shell state |fc) = a^{k)\0) is an eigenvector of the 
energy-momentum operator with eigenvalues k^, which means we can interpret as a creation operator for a state 
with 4-momentum /c^. Successive application of a^{kj) on the vacuum defines a basis of states with a varying number 
of states, defining the Fock space. Acting with the energy-momentum operator on a(fc)|0) gives us a negative energy 
eigenvalue, which means that we have to set a(fc)|0) = 0. The operator a{k) is an annihilation operator 
For operators it is crucial that we define their commutation relations. The basic property which we use to fix the 
commutators of a and is causality, namely that field configurations outside each other's light cone cannot affect 
each other and therefore their field operators commute 



or 



[0(a;)Xa;')]=O 
[0(a-)Xa;')]=O 



for {x - x'f < 

for t = t\ \x-x'\ > 



(3.10) 



The operator-valued form of Eq.(3.9 1 we can insert into the equal-time commutators {t — t' ,x x') of (j) and (p, 
where the latter allows us to vary the relative factor between a and and derive several independent relations. One of 
them reads 



= 



(p{x),(p{x) 
(Pk 



(3.11) 



<Pk' 



(27r)32fco (27r)32fc^ 

In this Fourier transform for free fco and k'^ the integral vanishes only if the integrand, and therefore all commutators 
under the integral vanish, at least as long as x 7^ x' . For x — x' the condition [4>{x^ t), 4>{x' , t)] = i5^{x — x') fixes the 
third commutation relations for bosonic creation and annihilation operators 

a\k),a^{k')\ = 

a(fc),a(fc')j = 

a(fc),a^(fc')j = (27r)32fco 5^{k~k') . (3.12) 

Using these commutators we can analyze specific configurations values of the kind (0|a(fc)a^(/c)|0), which do not 
vanish. It turns out that the integral defining the expectation value for the energy k^ in such a basis diverges. This 
problem we can solve ad hoc by postulating that sandwiched between vacuum states we only evaluate combinations 
where all annihilation operators a are moved to the right and all creation operators a) to the left 



: a) a 



a) a 



: a) : = 



aa : = aa 



aa^ : — a^ a 



(3.13) 



This normal ordering we can interpret as ordering the operators in a way which we can interpret and neglecting 
commutators which appear on the way, sandwiched between vacuum states, for example (0| [a, a^ |0). If we want a 
Hamilton operator to only give positive but finite energy states we need to include a time ordering into its definition. 
The Wick theorem links normal ordering and time ordering in a way which makes them identical as long as we only 
compute tree level leading order processes. 



Looking at the scattering process we want to evaluate we recognize that what we need to describe is the propagation 
of a boson generated in e+e^ annihilation to the point where is splits into a quark and anti-quark pair Of course, this 
photon we cannot assume to be a real particle on its mass shell, but we can still compute the appropriate propagator. 
We will see later that it is defined as a time ordered product of two field operators which for scalar fields reads 



iDpix - x') = {0\T {4>{x)(pix')) |0) 



(3.14) 



120 



3 LHC PHENOMENOLOGY 



The time ordered product of two operators is defined as 

T {A{x)B{x')) = 9{xo - Xa)A{x)B{x') + 0{x'o - xo)B{x')A{x) , (3.15) 

Time ordering and normal ordering are linked by the Wick theorem. To have our result ready later, we again use the 
operator version of Eq.( 3.9 1 to evaluate this combination 

(O|(^(x)0(x')|O) 



xo>x'„ 





d^k' 


(27r)32fco 


(27r)32fc^ 


(Pk 


d^k' 


(27r)32fco 


(27r)32fc^ 


d^k 


d^k' 


(27r)32fco 


(27r)32fc^ 


d^k 


d^k' 


(27r)32fco 


(27r)32fc^ 


d^k 


— ik{x — x' 



—ik'. 



'a(fc') |0) 



- (0| e^('=^^+'='^')a^(^)a^(fc') + e-'('=^-'='^')a(fc)a^(fc') |0) 



(27r)32fco 



g-»(/cx-/c'x')^0|a(fc)at(fc')|0) with (0|a^a^|0) = 

g-»(/c.-/c'x')^0|[a(fc)a^(fc')]|0) 

(0|0) with [a,a^] = {2Ti f2kn 5^{k-k') 



(27r)3 



2fco 



(3.16) 



In the integral fco is given by its the on-shell value, so for k — k' we also have fco = /cq. Under the assumption 

Xo > x'q the last ratio under the integral can be viewed as the result of a contour integration over the fco component, to 

allow us to write the propagator as a 4-dimensional integral over d^fc. This integral we discuss in detail in 

Section [2. 1.2| where we find in Eq.( |2.17| i 



2^ 



^-iko{xo-x'g) ^-ikg^'(xo-x'g) ^ikg^'{xo-x'g) 

dkoT^ , = 0{xa - x'q) —j— h 9{x'q - xq)- 



fc2 — iri^ + is 



2k 



(OS) 





2fc^°^^ 



(3.17) 



As in Eq.( 3. 16 1 but now explicitly denoted fc(,°^> = y/W + m? is the on-shell value. Using this result and slightly 
abusing our notation by now writing fco for the zero component of the 4-dimensional fc integration we can write 



{m^)<p{x'm 



xo>x'q 



d^k 

(2^ ' 

d^fc 
d^k 



^—ikQixQ — x'^) 
^0 



{xo - Xo) 



p-ika{xa-x'„) 
^ik[x — x ) 



-ik{x—x') 



+ ie 

1 

fc2 — + is 



(3.18) 



Similarly, we can show the same relation for xq < x'q, picking up the other theta function and then show for the 
combination of the two 

^ k -~ik{x-x') (^1) 



Df'(x-x') 



{2n) 



(3.19) 



This Feynman propagator is a Green function for the Klein-Gordon equation Eq.( 3.6 1, which we can explicitly 
confirm as 



{n + m'^)DF{x-x') 



d^k 

d^k 
d^k 



(-1) 



fc2 



2 I ™2\ ~ik-(x-x') 
+ 171)6 ^ ' 



(-1) 



fc2 



^'^i^-i^-^') ^ S\x ~ x') . 



(3.20) 
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This concludes our discussion of bosonic fields. Using a scalar field instead of a vector field we have shown how the 
field can be expressed in terms of creation and annihilation operators and what the commutation rules for the scalar 
fields as well as for the creation and annihilation operators are. A major problem arises when we sandwich these 
operators between vacuum states, which means that such insertions have to be normal ordered. In addition, the time 
ordered correlation function of two scalar fields is the Feynman propagator, i.e. an inverse of the Klein-Gordon 
equation defined over the entire position space. 

All these properties we will later use for the photon field. The photon is a vector field, where each of the 
components obey the Klein-Gordon equation. The commutation relations and the photon propagator will not change, 
but they will simply be dressed up by factors g^,^ where appropriate. For the propagator this generalization is strictly 



speaking gauge dependent, g'^" corresponds to Feynman gauge. Nevertheless, in Section 2.1.2 we show that 
additional terms from other gauge choices do not contribute to our scattering amplitude. 

Scalars and gauge bosons are not the only particle we find in nature. Matter particles, or fermions, like electrons or 
quarks have a different equation of motion and a different contribution to the Lagrangian. No matter how it looks, the 
equation of motion for fermion fields ip has to be invariant under a Lorentz transformation 

i;{x) ij'ix) = Ai/2 ij{A-'x) , (3.21) 

where A1/2 is a different representation of the Lorentz transformations. It is called spinor representation and we can 
define it using the four Dirac matrices 7^ with their anti-commutator Clifford algebra 

{7^7''} = 2g'^"ll , (3.22) 

The unit matrix has the same size as the 7 matrices. That we usually write them as 4 x 4 matrices has nothing to do 
with the number of — also four — 7 matrices. The explicit form of the 7^ matrices is not relevant because it never 
appears in actual calculations. All we need is a few trace relations arising from their commutators. A representation of 
the Lorentz algebra in terms of the Dirac matrices is 

S^"" = [7^7"] implying A1/2 - cxp (^-'-oj^^S^^''^ . (3.23) 

Using the transformation property Ajy^27^^i/2 — ^^ul'^ we can postulate an equation of motion for the fermions, the 
Dirac equation 

(i7^9^ - to) iPix) = {id - to) il:{x) = . (3.24) 

Of course, we need to check that this equation is stable under Lorentz transformations, keeping in mind that A1/2 
commutes with everything except for the Dirac matrices 

{i^d^ - m) ^j{x) ^ {t^^{A~^r^d, - to) Ai/2^(A-ia;) 

= A1/2A7/2 {ir{A-')\d, - to) Ai/27^(A-ix) 
= Ai/2 {iA-^^rM/2{A-')\d, ~ to) ^{A-'x) 
= Ai/2 {iA%Y{^-')\d, - to) i,{A-^x) 
= Ai/2 {ig.'Yd, ~ m) V(A-ix) 

= Ai/2 {i-f'^dy - to) ^{A-^x) = . (3.25) 

An interesting side remark is that we can multiply the Dirac equation with {—i'^^d^ — m) and obtain the 
Klein-Gordon equation {d^ + m'^)ijj — which will be useful when we construct a fermion propagator 
An additional problem is that for example to define a mass term in the Lagrangian we need to form Lorentz scalars or 
invariants out of the fermion fields tp. Naively, {ip^ip) would work if the Lorentz transformations in {ip^ Ay^Ai^2''P) 
cancelled. Unfortunately A1/2 is not a unitary transformation, which means we have to go beyond (V'^V')- One can 
show that the Dirac adjoint 

ip ^ Tp^j^ with ip i'A^^^ and ^pip ~^ V'V' (3.26) 
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has the correct transformation property. This allows us to write down the Lagrangian for a fermion field 

^ D - m)V' . (3.27) 

Just like for bosons we can show that this term produces the Dirac equation of motion. As a side remark, gauge 
interactions can be introduced into the Lagrangian via a covariant derivative 

^ D ^ (iD - m) V = V' («(^ + ie-QA) - m) V' = V' («(^ - m) V' + eQA^ ^jf"^ (3.28) 



Everything written above we could as well apply to classical fields. Just like in the bosonic case we need to define the 
Dirac field operator in terms of plane wave coefficients 



{2lTf 



spin 

where the fermion spin can be ±1/2. In the absence of any other constraints we have four generating operators, 
a, a\ b, . Acting on the vacuum a and b are annihilation operators, and a' , b^ are creation operators, a for particles 
and b for anti-particles. These operators only depend on the momentum 3-vector because the fourth component 
follows from the dispersion relation of the on-shell particles. The way we introduce the spinors u and v the same 
would hold for them, but there are instances where we use them also for off-shell states and we have to take into 
account their dependence on the complete momentum 4-vector. Again following causality we postulate the 
anti-commutation relations, for example at equal time t = t' 

{i;{x),i^{x')} = 0={jj{x),^{x')} 

{ V(a;), V'(a;')} = 7°<5^(x - f ) . (3.30) 

Trying the same thing with commutators simply does not work, as Michael Peskin nicely shows in his book. These 
anti-commutators we can hnk to anti-commutators for the creation and annihilation operators in momentum space 

='5r«(27r)^2fc° S^{k-k') 
|a,(fc), fe,(fc')} = for all other a^^' and b^^\ (3.31) 

provided we know the spin sums for the so-called spinors u and v and their Dirac adjoints 

Us{k)us{k) = k + m 

spin 

^Vs{k)vs{k) = k-m. (3.32) 

spin 

Strictly speaking, ifc is a (4 x 4) matrix, so in the mass term we omit the the corresponding unit matrix. To produce 

such a matrix u and v are 4-dimensional objects. 

These anti-commutator relation have the fundamental consequence that for two fermion states generated from the 
vacuum we have to keep track of the ordering 

|e-(fc,r)e-(fc',r')) - al{k)al,ik')\0) = -4,(fc')4W|0) = -\e- {k' ,r')e- ik,r)) . (3.33) 

This factor (—1) needs to be taken into account when we apply normal ordering to fermions. For k = k' and r = r' 
this impUes Pauli's exclusion principle: two identical fermion states cannot co-exist exactly in the same point. 
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Again, this is all we need to say about fermions to compute our electron-positron scattering process. We know the 
Dirac equation and the corresponding contribution to the Lagrangian, including the definition of the Dirac adjoint to 
construct Lorentz scalars. The quantized fermion field obeys anti-commutation relations, as do its creation and 
annihilation operators. To link them we need to know the form of the spin sums over the spinors u and i;. 

As one example that shows how to compute a transition amplitude without using Feynman rules we choose the 
scattering process 

e"(fci,si) + e+(A:2,S2) ^ S3) + 9(^2,54) , (3.34) 

where kj, pj and sj are the 4-momenta and spin orientations of the external fermions. Infinitely far in the future, i.e. 
asymptotically for t +00 the initial state limf_j._oo \t) = \i) has evolved into the final state limt^co |^) = via a 
yet unknown linear operator S. To describe a scattering process into a given abstract final state (/| we need to 
compute the transition amplitude 



S = {f\t ^ (X)) = {,f\S\t) = (9394|5|e+e2-) = {Ola^b^ S a\bl\0) 



(3.35) 



We use one index to indicate the momenta and spins of the external particles, for example in the creation and 
annihilation operators. 

S we can compute from the time evolution of the initial state idt\t) — H{t)\t) in the interaction picture with a time 
dependent Hamilton operator. The evolution equation then reads 



\t) / dt'H{t')\t') 



dt'mt') 



dt"H{t') 



dt"H{t")\t" 



dt"n{t") 



t" 



dt'" n{t"')\t" 



(3.36) 



just inserting the same evolution into itself twice. The problem with this form is that it still involves \t"') at the very 
end. What we instead want is something that is only proportional to We can achieve this when we look at the 
integration boundaries: the integration range becomes smaller in each step of primed variables. In the limit of 
infinitely many insertions the remaining integrals should be over less and less time, starting at —00. This means that 
neglecting higher powers of the Hamilton operator H or, as we will see later, neglecting powers of a coupling 
mediating the interaction between the states involves we can rewrite this form as 



00) = |z) + i-i) 
= K> + {-i) 

= V) + (-0 

= S\i) . 



dt"H{t') + 

) 

dt'Hit') b;> + (-i)' 

dt'H{t') + 



dt'n{t') / dt"n{t")\i) + o{n') 

J J —00 

dt'dt" e{t' - 1") H{t')n{t") \i) + oin'') 

) 
; 

dt'dt" T{n{t')n{t")) |z) + 0(^3) 



(3.37) 



where the time ordered product only contributes a factor two for two identical and hence commuting operators. The 
last line of Eq.( 3.37 1 with the time ordered Hamilton operators and the corresponding factor 1/(2!) is important 
because it means that we can sum S as an exponential series 



S 



(3.38) 



and ensure that it generates a unitary transformation. For our computation we will be fine with the quadratic term 
which we explicitly list. 
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The form of the interaction Hamiltonian for two fermionic currents each involving a different particle species j with 
charge Qj follows from the covariant derivative Eq.(3.28 1 

= - f d'x^U^) D -'Qo I d^^A^ : ^^-I'^i^, : , (3.39) 
J ^ J 

in terms of the 4-vector x including its first entry t ~ xq, the fermion current {ijj^^ip), and the photon vector field A^. 
The current is normal-ordered, i.e. annihilation operators a, h are moved to the right and creation operators a) , are 
moved to the left. For fermions a switch includes a minus sign, while for bosons the two operators are simply 
exchanged. 

To connect four creation and annihilation operators arising from the external states we need four such operators from 
S, which means the first term which will contribute to the scattering process is the quadratic term in T-L. The two 
Hamiltonians contributes one electron and one quark current. It is not hard to check that these two assignments give 
the same result, so we only follow one of them and include an additional factor two in the formula for 

^ = 2 X til! I dt'dt"{0\a3biT{n{t')nit"))a\bl\0) 



2 

= ~e'Q,Qg I d''x'd^x"{0\a3b^T {■.^^{x')j^%{x') : A^(x') : V^e(^")7.V'e(a;") : A^ix")) a\bl\0) 

= -e^QeQq j d^x'd^x"{Q\T{A^'(x')A''{x"))\Q) 

(0|a3&4T (: '^^{x')-i^^q{x') : : i^^{x"M,{x") :) b\a\\Q) . (3.40) 
The first of the time ordered product is a gauge boson propagator in Feynman gauge 

{0\T{A>^{x')A%x"))\0) = j ^ e^9(-'-")Z^ = ,5P-i^^(^'_^") . (3.41) 

The second time ordered product we still have to evaluate by properly combining the creation and annihilation 
operators with the fermion fields. For example, we can write 

(0|^(x)at(p)|0) = j ^^^(0| (e:^'^b\k)v{k)+e-^^'^a{k)u{k)) a\m 

spins 
spins 

^ e-^'^^uik) ((0| [a(fc)at {p)] |0) + (0|at (p)a(fc) |0) 

spins 

d^k 

{2n)^2E 



^e-*'=^M(/fc) (^{2TTf2E6^(k-p){0\Q)+0j 



spins 



="u(fc) ({2TTf2E6'^{k~p){0\{))+Q 



{2-KY2E 

= e-'P^ u{p) . (3.42) 

The normal ordering of the fermion currents in this case is never really needed after properly defining the interaction 
Hamiltonian. Correspondingly, we find the other non-zero normal-ordered combinations 

(0|6(p)V(a;)|0) = e^P^w(p) 
(0|a(p)V^(x)|0) = e*P^u(p) 

(0|V^(a:)6^(p)|0) = e-'P^ v{p) . (3.43) 
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All other combinations of a, a\ b, with V' and vanish when we sandwich them between vacua. Before we 
contract the four creation and annihilation operators we need to keep in mind that electromagnetic currents only link 
one set of particle, they do not convert quarks into electrons. This limits the number of permutations we need to take 
into account. We find one unique non-vanishing combination of external states and current creators and annihilators, 
namely matching a^tp and 64-0 for the quarks and tpb\ as well as ipa\ for the electrons. 

(010364 : i^gix'hM^') ■■ ■■ ^ei^'ll^^M^l ■■ 4bl\0) = e'^P'+P''>^'u3^^V4 e-'('=i+'==)^"i;27."i • (3.44) 
Combining the different contributions we find 

(27r)4 g2 



(2^)4 --<^^"-q2 

/d^ 1 
77m "37m^4^W27^"i ^"^{q + ki+ k2)5'^{-q - Pi - P2) 
(27r)4 

= z(27r)^(5''(fci + fc2 - _pi - P2) e^QeQq u^j^Vij- — .y t'27^'»i • (3-45) 

Stripping off unwanted prefactors we can define the transition matrix element for quark anti-quark production in QED 

as 

M = e'^QeQq (u37m"4)7, — rT-^i^2l'"ui) , (3.46) 

(fci + fc2)^ 

with (fci + fc2)^ = {pi + P2)^- This matrix element or transition amplitude we have to square to compute the 
transition probability. Part of the squaring is the sum over all spins which uses the spin sums Eq. p.32| l to get rid of the 
spinors and then some trace rules to get rid of all Dirac matrices. Neither for the spinors nor for the Dirac matrices we 
need to know their explicit form 

l-^P = ^'^QlQl /fc +fc )4 (^47!^W3)(W37^1;4) {Ul-i''v2) {V2YU1) 

spin, color ^ 

=e^QlQlNc Y X!('^4>'"3)("37m«4) {ui-i''v2){v2'y^ui) 

^ spin 

=e'^QlQlNc ,^ )4 X!('^4)i(7i^)«j("3)j(w3)fe(7A<)fei(«4)/ • • • for one trace 

^ spin 

^e^QlQlN, I \Y.{vMvAh\ (E("3),(^3)J (7.).,(7m)a 



(fcl 


+ k2Y 
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(fcl 


+ k2Y 
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(fcl 


+ k2Y 




1 


(fcl 


+ k2Y 




1 


(fcl 


+ k2Y 




1 



)kl ■ 



spin / \ spin 

„4^2^2 ^ 



--e^QiQ'^N, _^ ^^^^ {Pih{7.h{p3)Alt^)ki ■ ■ ■ using Eq.([332i 

--e^QlQlN, Tr {Pa.P^lt.) Tr iPi-f''p27n both traces again 



=e'^QlQlNc ^ 4 {pauPi^L + Pst^Piu - 9^iy{piP2)) 4 (fc^ fc^ + fc^'fca - gf,u{kik2)) 

n2 
1 



" (fcl+fc2r 

(fcl + k2Y 

=i2e^QlQlN, [(fciPi)(fc2P2) + (fciP2)(fc2Pi)] , (3.47) 



=lQe^QlQlN, 1^ ^^^^^ [2(fciPi)(fc2P2) + 2(fciP2)(fc2Pi) + x (piP2)(fcifc2)] with 5^,5'"- = 4 

1 



(fcl + fc: 



2 



In the fourth line we have used a know expression for the Dirac trace. More complicated and longer traces become 
very complicated very fast and we use Form to evaluate them on the computer The color factor Nc is the number of 
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color single states we can form out of a fundamentally charged quark and an anti-quark with the opposite color 
charge. Because color only appears in the final state we sum over all possible color states, i.e. multiply with Nc- This 
result for the matrix element and the matrix element squared is the same expression as we get starting from the usual 



Feynman rules in Eq.(2.8 i 



Feynman rules are calculational rules which we can extract from the Lagrangian. These building blocks representing 



external and internal particles we combined to construct A4. To compute the matrix element in Eq.(3.44i skipping 
everything we did to get this formula we start drawing a set of Feynman diagrams which represent all ways we can 
link a given initial state to a given final state via a set of interaction vertices and internal propagators. For qq 
production in e+e~ scattering described by QED there exists only one such diagram 



It consist of four external fermions, one internal photon, and two interaction vertices. From looking at Eq.( 3.44| i we 
know how to describe external fermions in terms of spinors: 



symbol 


diagram 


u(p, s) 


incoming 


; fermion (e^ , q) with momentum p and spin s 


v(p,s) 


incoming 


I anti-fermion (e+, q) 


u{p, s) 


outgoing 


fermion (e~, q) 


v{p, s) 


outgoing 


anti-fermion (e+, q) 



Spin sums are the only way to get rid of spinors in the computation. Eq.( 3.32 1 shows that as long as we neglect 
fermion masses the two spinors u and v for particles and anti-particles are identical. To link external particles to each 
other and to internal propagators we need vertices. In Eq. p.44[ ) we see that two fermions and a gauge boson interact 
via a vector current proportional to 7^*. As a convention, we add one factor i, so the vertex rule in QED becomes 



(3.48) 



This factor i we can change, as long as we consistently change it for all 3-point and 4-point vertices in our theory. 
Finally, there is the intermediate photon which propagates between the 7^ and the 7"^ vertices. The wave line in the 
Feynman diagram corresponds to 



p^ + ie 

where again the factor —i is conventional. For a bosonic propagator it does not matter in which direction the 



(3.49) 



momentum flows. Blindly combining these Feynman rules gives us directly Eq.(3.44 1, so all we need to do is square 
the matrix element, insert the spin sums and compute the Dirac trace. 

Whenever we compute such a matrix element starting from a Feynman diagram nothing tells us that the lines in the 
Feynman diagrams are not actual physical states propagating from the left to the right. Even including loop diagrams 
will still look completely reasonably from a semi-classical point of view. Feynman rules define an algorithm which 
hides all field theory input in the calculation of scattering amplitudes and are therefore perfectly suited to compute the 
differential and total cross sections on the computer. 



The vector structure of the QED couplings, for example mediated by a covariant derivative Eq.( 3.28 1 we did not 
actually motivate. It happens to work on the Lagrangian level and agree with data, so it is correct. We can write a 
completely general interaction of two fermions with a boson in terms of basis elements 



basisj 



(3.50) 
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For a real (4 x 4) matrix M the necessary 16 basis elements we would like to write in a way that makes it easy to keep 
track of the Lorentz transformation properties. The vector 7^ from the QED interaction gives us four such basis 
elements, the unit matrix a fifth. Another six we already know as well, they are the generators of the spinor 
representation [7^, 7"^]. We can check that all of them are linearly independent. Five basis elements in a handy form 
are still missing. 

To define them, we need to know that there exists another (4 x 4) matrix which is invariant under proper Lorentz 
transformations. We can write it in terms of the four Dirac matrices in two equivalent forms 

75 = *7°7V'7'7' = ^ W'.7^7"7''7" , (3.51) 

using the totally anti-symmetric Levi-Civita tensor e^j^pg.. This form, by the way, already shows a major technical 
complication in dealing with 75: in other than four space-time dimensions we do not know how to define the 
Levi-Civita tensor, which means that for example for regularization purposes we cannot analytically continue our 
calculation to n = 4 — 2e dimensions. The main property of 75 is equivalent to that fact that it is another basis element 
of our (4 X 4) matrices, it commutes with the other four Dirac matrices [7^, 75] ~ 0. Combining this new object as 
(7^75) ™d 175 gives us all 16 basis element for the interaction of two spinors with a third scalar, vector, or tensor 
field: 





degrees of freedom 


basis elements Mj 


scalar 


1 


1 


vector 


4 




pseudoscalar 


1 




axial vector 


4 


7'' 75 


tensor 


6 


^[7^71 



The field indices need to contract with the indices of the object iJjMip. Again, the factors i are conventional. In the 
Standard Model as a fundamental theory tensor interactions do not play a major role. The reason is the dimensionality 
of the Lagrangian. The mass dimension of a fermion field ^ or is m'^/^ while the mass dimension of a scalar field, a 
photon field, or a derivative is m. For example from Eq.( |3.28| l we see that every term in the QED Lagrangian is of 
mass dimension four This is required for a renormalizable fundamental field theory. Introducing a tensor coupling we 
have to contract two indices, /i and i^, and not with the metric tensor. The only other objects coming to mind have 
mass dimension m, which means that together with the fermion fields the term in the Lagrangian has mass dimension 
of at least rn^ and is therefore not allowed. 



The second obvious question is: what does it mean to include a factor 75 in the interaction, i.e. what distinguishes a 
scalar from a pseudoscalar and a vector from an axialvector? An easy answer we can give by defining three 
transformations of our field in space and time. The first one is the parity transformation P which mirrors the three 
spatial coordinates {t, x) {t, —x). The second is charge conjugation C which converts particles into their 
anti-particles. Both of them leave the Dirac equation intact and be represented by a unitary transformation. The third 
transformation is time reversal T which converts {t, x) — > (— x), also leaves the Dirac equation intact, but only has 
an anti-unitary representation. Every single one of them is violated in our Standard Model. 
Instead of writing out the representation of these transformations in terms of Dirac matrices we characterize them 



using the basic interactions from Eq.(3.50 1. Parity symmetry does not allow any interaction including 75, i.e. it 
forbids the pseudoscalar and the axialvector Time reversal symmetry does not allow any complex couplings gj . 
Because any field theory described by a Lagrangian not including some kind of external field is invariant under CPT 
(and we have never observed CPT violation) a combined CP invariance is essentially the same as T invariance. 

To again look at the same question we rotate the {1, 75} plane and define the two (4 x 4) matrix valued objects 

Pi?,L - ^ (11±75) . (3.52) 
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It is easy to show that the two are orthogonal 



'lVr =1(^-15) (1+75) = ^ (1-7|) 



and projectors 



T>2 
R,L 



(21 ± 275 



using 75 



(]1±75) =Pfl,L 



(3.53) 



(3.54) 



In QED these combinations do not play any role. Their effect on kinetic and mass terms we compute in 

Eqs.( 1.12|1.13 1. Looking at interactions, we can for example define a combined vector-axialvector coupling as 

ryfj- -± 7/^75 = 27^P/j i. Sandwiching this coupling between fermion fields gives for example 



^7^PiV = V^7X^ 

= ?/>1-Pi7o7^PiV' with 

= (PLV^)So7MPi^ with 

= V'l7mV'l with 



{75,7a.} = 
75 =75 



(3.55) 



If we call the eigenstates of P/? ^ right handed and left handed fermions iJjl.r. and this property the chirality of a 



fermion the form of the coupling in Eq.(3.55 1 shows that we can define a vector coupling between only left handed 
fermions by combining the vector and the axialvector couplings with a relative minus sign. The same is of course true 



for right handed couplings. In Section 1.1.2 we show that kinetic terms can also be defined independently for left and 
right handed fermions, while mass terms or scalar interactions mix the two chiralities 



(3.56) 



In other words, we can write for example QED in terms of independent left and right handed fields as long as we 
neglect all fermion masses. This is the chirally symmetric limit because the Lagrangian is symmetric under t/j^ O ipR- 
Introducing fermion masses breaks this chiral symmetry, or turning the argument around, to introduce fermion masses 
we need to combine a left handed and a right handed fermion fields and give them one common Dirac mass . 



At this stage it is not obvious at all what chirality means in physics terms. However, we will see that in the Standard 
Model the left handed fermions play a special role: the massive W bosons only couple to them and not to their right 
handed counter parts. So chirality is a property of fermions known to one gauge interaction of the Standard Model as 
part of the coiTesponding chai-ge. 



There exists a property which is identical to chirality for massless fermions and has an easy physical interpretation: 
the helicity. It is defined as the projection of the particle spin onto its 3-momentum direction 



h = s- [s + L] ■ J ■ with p -L L , (3.57) 

\p\ \ / IpI \p\ 

or equivalently the projection of the combined orbital angular momentum and the spin on the momentum direction. 
From quantum mechanics we know that there exist discrete eigenvalues for the z component of the angular 
momentum operator, symmetric around zero. Applied to fermions this gives us two spin states with the eigenvalues of 
h being ±1/2. Unfortunately, there is no really nice way to show this identity. What we need to know is that the spin 
operator is in general given by 

5 = 757" 7- (3.58) 

We can show this by writing it out in terms of Pauli matrices, but we will skip this here and instead just accept this 
general form. We then write the solution ij) to the massless Dirac equation after transforming it in momentum space 
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ijj{x) — u{p) exp(— ip • x) 
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In other words, the chirality operator 75 indeed gives us the helicity of the particle state, modulo a sign depending on 
the sign of the energy. For the helicity it is easy to argue why for massive particles this property is not Lorentz 
invariant and hence not a well defined property: massless particles propagate with the speed of light, which means we 
can never boost into their rest frame or pass them. For massive particles we can do that and this way switch the sign of 
p and the sign of h. Luckily, for almost all Standard Model fermions we can in many situations neglect the fermion 
mass. 



3.2 Jets and fat jets 

Throughout Section|2]we pretend that quarks and gluons produced at the LHC are actually what we observe in the 
LHC detectors. In perturbative QCD those particles are the final states, even though they cannot exist individually 
because QCD is asymptotically free. For example, in Eq.( |2.63| l we apply wave function renormalization factors to 



them. On the other hand, in Section 2.2.2 we see that the strong coupling explodes at low energy scales around Aqcd 
which means that something has to happen with quarks and gluons on their way through the detectors. Indeed, the 
gluon and all quarks except for the top quark hadronize before they decay and form bunches of baryons and mesons 
which in turn decay in many stages. These particles carry a lot of energy, around the typical energy scales 0{mH) we 
are interested in at the LHC. Relativistic kinematics then tells us that these baryons and mesons are strongly boosted 
together to form jets. Those jets we measure at hadron colliders and link to the partons produced in the hard 
interaction. 



Consequently, in Section|2]we use the terms parton and jet synonymously, essentially assuming that each parton at the 
LHC turns into a jet and that the measured jet four-momentum can be linked to the parton four-momentum. The way 
we usually define jets are so-called recombination algorithms including for example the Cambridge-Aachen or (anti-) 

algorithms. Imagine we observe a large number of energy depositions in the ATLAS or CMS calorimeter which 
we would like to combine into jets. We know that they come from a small number of partons which originate in the 
hard QCD process and which since have undergone a sizeable number of splittings, hadronized and decayed to stable 
particles. Can we try to reconstruct the original partons? 

The answer is yes, in the sense that we can combine a large number of proto-jets into smaller numbers, where 
unfortunately nothing teUs us what the final number of jets should be. We know from Section|2]that in QCD we can 
produce an arbitrary number of hard jets in a hard matrix element and another arbitrary number via soft or collinear 
radiation. Therefore, we need to tell the jet algorithm either how many jets it should arrive at or what should be the 
resolution of the smallest proto-jets we consider partons, whatever the measure for this resolution might be. In the 
coming section we will therefore discuss what criteria for a proto-jet recombination to an assumed physical jets exist. 



3.2.1 Jet algorithms 

The basic idea of recombination algorithms is to ask if a given proto-jet has a soft or collinear partner This follows 
from the physics arguments in Section|2]from which we know that partons produced in a hard process preferably turn 
into collinear pairs of partons as approximately described by the parton shower To decide if two proto-jets have arisen 
from one parton leaving the hard process we have to define a coUinearity measure. This measure will on the one hand 
include the distance in R space as introduced in Eq.(2.35 1 and on the other hand the transverse momentum of one 
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proto-jet with respect to another or to the beam axis. ExpUcit measures are, weighted by the relative power of the two 
ingredients 
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(3.60) 

The parameter is a measure of the angular size of the jet and can range anywhere between 0.4 and 1.5. It balances 
the jet-jet and jet-beam criteria; small values of D increase yij, so yij > UiB becomes more likely. Because of the 
power dependence on the transverse momenta the three algorithms start with soft constituents {kx), purely geometric 
(Cambridge-Aachen C/A) or hard constituents (anti-fc^) to form a jet. All of them have in common that they link 
physical objects, namely calorimeter towers, to other more or less physical objects, namely partons from the hard 
process. As we can see from the different choices in Eq.( 3.60| l we have all the freedom in the world to weight the 



angular and transverse momentum distances relative to each other While for the fcy and the C j A algorithms it is 
fairly clear that the intermediate steps have a physical interpretation, this is not clear at all for the anti-Zcy algorithm. 



We define two proto-jets as coming from one jet if yij < y^m, where j/cut is the scale we give to the algorithm. The jet 
algorithm proceeds as 

(1) for all combinations of two proto-jets in the event find the minimum y™'" — mmij{yij,yiB) 
(2a) if y™'" = yij < y^ut merge proto-jets i and j and their momenta, go back to (1) 
(2b) if y™'" — yiB < y^ut remove proto-jet i, call it beam radiation, go back to (1) 
(2c) if y™'" > ycut keep all proto-jets, done 

The result of the algorithm will of course depend on the resolution y^m- Alternatively, we can give the algorithm the 
minimum number of physical jets and stop there. The technical question is what 'combine jets' means in terms of the 
four-momentum of the new jet. The three-momentum vectors we simply add fcj + kj ki. For the zero component 
we can assume that the new physical jet have zero invariant mass, which is inspired by the massless parton we are 
usually looking for. If instead we add the four-momenta we can compute the invariant mass of the jet constituents, the 
jet mass. As we will see in the next section this allows us to extend the concept of jet algorithms to massive particles 
like aW or Z boson, the Higgs boson, or the top quark. 

From Section|2]and the derivation of the collinear splitting kernels it is obvious why theorists working on perturbative 
QCD often prefer the fc^ algorithm: we know that the showering probability or the collinear splitting probability is 
best described in terms of virtuality or transverse momentum. A transverse momentum distance between jets is 
therefore best suited to combine the correct proto-jets into the original parton from the hard interaction, following a 
series of actual physical intermediate splittings. Moreover, this transverse momentum measure is intrinsically infrared 
safe, which means the radiation of an additional soft parton cannot affect the global structure of the reconstructed jets. 
For other algorithms we have to ensure this property explicitly, and you can find examples in QCD lectures by Mike 
Seymour. 

The problem of the kr algorithm arises with pile-up or underlying event, i.e. very soft QCD activity entering the 
detectors undirectionally. Such noise is easiest understood geometrically in a probabilistic picture. Basically, the 
low-energy jet activity is constant all over the detector, so we subtract it from each event . How much energy deposit 
we have to subtract from a reconstructed jet depends on the area the jet covers in the detector Therefore, it is a major 
step that even for the kx algorithm we can compute an IR-safe geometric jet size. The C/A and anti-fcy algorithms are 
more compact and easier to interpret experimentally. 
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3.2.2 Fat jets 

Starting from the way the experiments at the Tevatron and the LHC search for bottom jets, including several detailed 
requirements on the content of such jets, the question arises if we can look for other heavy objects inside a jet. Such 
jets involving heavy particles and (usually) a large geometrical size are referred to as fat jets. Three main motivations 
lead us into the direction of fat jets: first, dependent on our physics model heavy objects like W bosons or top quarks 
might be boosted enough to fit into a regular jet of angular size R < 0.7. Secondly, a jet algorithm might include 
hadronic decay products which we would not trust to include in a regular mass reconstruction based on reconstructed 
detector objects. And finally, even if only a fraction of the heavy particles we are searching for are sufficiently boosted 
such an algorithm automatically resolves signal combinatorics known to limit LHC analyses. 
At the LHC the experimental situation px /m > 1 we are guaranteed to encounter for electroweak gauge bosons, 
Higgs bosons, and top quarks. The more extreme case of px 3> m, for example searching for top quarks with a 
transverse momentum in excess of 1 TeV is unlikely to appear in the Standard Model and will only become interesting 
if we encounter very heavy resonances decaying to a pair of top quarks. This is why we focus on the moderate 
scenario. 



Historically, fat jet searches were first designed to look for strongly interacting W bosons. Based on the kx algorithm 
they look for structures in the chain of y values introduced in Eq.( 3.60 1, which define the kinematics of each jet. For 
such an analysis of y values it is helpful but not crucial that the intermediate steps of the jet algorithm have a physics 
interpretation. More recent fat jet algorithms looking for not too highly boosted heavy particles are based on the C/A 
algorithm which appears to be best suited to extract massive splittings inside the jet clustering history. A comparison 
of different jet algorithms can be found in the original paper on associated Higgs and gauge boson production. Using a 
C/A algorithm we can search for hadronically decaying boosted W and Z bosons. The problem is that for those we 
only have one hard criterion based on which we can reject QCD backgrounds: the mass of the W/Z resonance. 
Adding a second W/Z boson and possibly the mass of a resonance decaying to these two, like a heavy Higgs boson, 
adds to the necessary QCD rejection. For Higgs and top decays the situation is significantly more promising. 



R,-, ^ , ^ ^ > , (3.61) 



Starting with the Higgs tagger we search for jets which include two bottom quarks coming from a massive Higgs 
boson with rriH ^ 120 GeV. First, we run the C/A algorithm over the event, choosing a large geometric size R = 1.2 
estimated to cover 

1 m/f 2m// 

\/z{l - z) Pt,h Pt,h 

in terms of the transverse momentum of the boosted Higgs and the momentum fractions z and 1 — z of the two bottom 
jets. 

This fat jet we then uncluster again, searching for a drop in jet mass indicating the decay of the massive Higgs to two 
essentially massless quarks. The iterative unclustering we start by undoing the last clustering of the jet j, giving two 
subjets ji, j2 ordered such that mj^ > nij^. If the mass drop criterion nij-^ > 0.8 nij is fulfilled, i.e. j2 is soft enough 
to come from the underlying event or soft or collinear QCD emission, we discard j2 and keep ji, otherwise we keep 
both ji and j2; each subjet ji that is kept we further decompose recursively until it reaches some minimum value, e.g. 
rrij- < 30 GeV ensuring it does not involve heavy states. This way we obtain a splitting pattern which should only 
include massive splittings and which for the Higgs jet uniquely identifies the H bb decay. Making use of the scalar 
nature of the Higgs boson we can add an additional requirement on the balance based on min{p^j_^ , p^j^)AR^_^j^ . Of 
course, all actual numbers in this selection are object to experimental scrutiny and can only be determined after testing 
the algorithm on LHC data. 



Experimentally, the goal of such a Higgs search is a distribution of the invariant mass of the bottom quarks which 
gives us a signal peak and side bins to estimate the background. However, applying jet algorithms with very large R 
size makes us increasingly vulnerable to underlying event, pile-up, or even regular initial state radiation as described 
in Section 2.3.2 Therefore, we cannot simply use the mass of a set of constituents of the fat jet. Instead, we apply a 
filtering procedure looking at the same constituent with a higher resolution which can for example be 
i?fiit — min(0.3, Rf£/2). This filtering significantly reduces the y-(t) surface area of the relevant constituents and 
thereby the destructive power of the underlying event and pile-up. The invariant mass we include in the histogram is 
the mass of the three hardest constituents, the two bottom quarks and possibly a radiated gluon. 
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Figure 14: Invariant mass distributions for Higgs searches using fat jets from iJ — > 66 decays. For a Standard Model 
Higgs boson the production mechanisms are pp — > WH/ZH (left) and pp tiH (center). In cascade decays of 
supersymmetric squarks and gluinos we can apply the same search for the light Higgs boson (right). Figures from 
Refs. LSJ, t43J and [|44| (left to right). 



In a busy QCD environment another problem arises: errand jets from initial state radiation or other particles in the 
final state enter the fat jet algorithm and give us several mass drops in the fat jet history. To avoid shaping the 
background side bins we can include several (filtered) subjet combinations, ordered in the modified Jade distance 
PT,iPT,2(Ai?i2)"' — the original Jade distance includes is given by px .iPt ,2{^Ri2)^ ■ Examples for the invariant 
mass distributions for different Higgs search channels we show in Figure [14] They include Standard Model Higgs 
searches in WH/ZH production, in tiH production, and light supersymmetric Higgs searches in the decays of 
squarks and gluinos. 

From the above discussion we see that Higgs taggers rely only on one kinematic criterion , the mass of the hh pair. In 
terms of background rejection this is not much, so we usually add two bottom tags on the constituents which 
according to detector simulations can be very efficient. The two combined add to a QCD rejection of at least 10^'', 
which might even allows us to run a Higgs tagger over any kind of event sample and see if we find any Higgs bosons 
for example in new physics decays. 

While fat jet Higgs searches are targeted mostly at the Standard Model, looking for other boosted heavy particles is 
usually motivated by new physics scenarios. Looking for massive particles decaying to quarks top taggers are next in 
line. Starting from a C/A jet of size up to i? 1.5 we again search for mass drops, this time corresponding to the top 
and W masses. After appropriate filtering we apply these two mass windows conditions; the entire fat jet has to 
reproduce the top mass. Next, we require a mass drop corresponding to the W decay and effectively constrain a 
second combination of two decay jets evaluating the helicity angle of the left handed W decay. Instead of these two 
distinct steps we can also apply a two-dimensional condition on the kinematics of the three top decay products which 
avoids assigning the two W decay jets in cases where two combinations of decay jets give similar invariant masses. 
On the simulation level both methods give equivalent results. 

Applying these three kinematic conditions for example in the HEPTopTagger implementation gives a QCD rejection 
of a few per-cent. If this should not be sufficient for a given analysis we can increase the rejection rate by requiring a 
bottom tag which as a bonus also tells us which of the three top decay jets should reconstruct the W mass. When we 
use top taggers to look for new particles decaying to top quarks we are not only interested in finding boosted top 
quarks, but we would like to know their invariant mass. This means we would like to reconstruct their direction and 
their energy. Such a reconstruction is possible in a reasonably clean sample without too many QCD jets, i.e. provided 
the top quarks have large enough energy to boost all three decay jets into a small enough cone. 

While it seems like the C/A jet algorithm with its purely geometric selection has the best potential to search for 
massive particles in its jet history there exists a multitude of algorithms searching for boosted top quarks. Once the top 
quarks have very large transverse momenta the two-step mass drop criterion becomes less critical because the three 
decay jets are too close to be cleanly resolved. In this situation analyses based on the kx or anti-fcy algorithms can be 
as promising, even event shapes without any jet algorithm can search for such signals. 



3.3 Helicity amplitudes 
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3.3 Helicity amplitudes 



In Section [2T| we compute the cross section for Z production by writing down all external spinors, external 
polarization vectors, interaction vertices and propagators and squaring the amplitude analytically. The amplitude itself 
inherits external indices for example from the polarization vectors, while \A4 \ is a real positive number with a fixed 
mass dimension depending on the number of external particles. 

For the LHC nobody calculates gamma matrix traces by hand anymore. Instead, we use powerful tools like Form to 
compute traces of Dirac matrices in the calculation of In this section we illustrate that we can do even better. 

As an example, we consider our usual toy process 

Mu — > 7* — > /i^/i^ . (3.62) 

The structure of the amplitude A4 with two internal Dirac indices /i and v involves one vector current on each side 
{ufjfj,Uf) where f — u, ^ are massless fermions, so we do not have to be careful with the different spinors u and v. 
The entries in the external spinors are given by the spin of the massless fermions obeying the Dirac equation. For each 
value of /i = • • • 3 each current is a complex number, computed from the four component of each spinor and the 



respective 4x4 gamma matrix 7^ shown in Eq.(2.105 1. The intermediate photon propagator has the form g^i^/s, 
which is a real number for each value of fi = v. Summing over /i and ly in both currents forms the matrix element. To 
square this matrix element we need to sum Ai* x A4 over all possible spin directions of the external fermions. 

Instead of squaring this amplitude symbolically we can follow the steps described above and compute an array of 
numbers for different spin combinations numerically, just inserting the correct values for each component of each 
spinor Summing over the internal Dirac indices we compute the matrix element; summing over external fermion spin 
directions we compute the matrix element squared. This approach we illustrate for muon pair production based on the 
MadGraph/Helas implementation. 

MadGraph as part of MadEvent is a tool to compute matrix elements this way. It automatically produces a Fortran 
routine which calls functions from the Helas library to compute spinors, polarization vectors, currents of all kinds, etc. 



For our toy process Eq.( 3.62 1 the MadGraph output reads 



REAL*8 FUNCTION UUB_MUPMUM ( P , NHEL) 

C 

C FUNCTION GENERATED BY MADGRAPH 

C RETURNS AMPLITUDE SQUARED SUMMED/AVG OVER COLORS 

C FOR PROCESS : u u" -> mu+ mu- 

C 

INTEGER NGRAPHS, NEIGEN, NEXTERNAL 

PARAMETER {NGRAPHS= 1,NEIGEN= 1 , NEXTERNAL=4 ) 
INTEGER NWAVEFUNCS , NCOLOR 

PARAMETER (NWAVEFUNCS= 5, NCOL0R= 1) 

REAL*8 P (0 : 3, NEXTERNAL) 
INTEGER NHEL (NEXTERNAL) 

INCLUDE 'coupl.inc' 



DATA 


Denom (1 ) / 




1/ 






DATA 


(CF(i,l ),i = l 


, 1 ) 


/ 3/ 






CALL 


IXXXXX (P (0, 1 


) , ZERO 


, NHEL (1 


) ,+1, 


W(l,l )) 


CALL 


OXXXXX (P (0, 2 


) , ZERO 


, NHEL (2 


) ,-1, 


W(l,2 )) 


CALL 


IXXXXX (P (0, 3 


) , ZERO 


, NHEL (3 




W(l,3 )) 


CALL 


OXXXXX (P (0, 4 


) , ZERO 


, NHEL (4 


) ,+1, 


W(l, 4 ) ) 


CALL 


JIOXXX (W (1, 1 


) , W (1, 2 


) , GAU , 


ZERO 


, ZERO 


CALL 


lOVXXX (W (1, 3 


) , W (1, 4 


) ,W{1, 5 


) , 


GAL , AMP ( 1 


JAMP 


1) = +AMP ( 


1) 








DO I 


= 1, NCOLOR 











,w (1, 

) ) 



DO J = 1, NCOLOR 

ZTEMP = ZTEMP + OF ( J, I) * JAMP ( J ) 
ENDDO 

UUB_MUPMUM =UUB_MUPMUM+ZTEMP*DCONJG (JAMP (I) ) /DENOM (I) 
ENDDO 
END 



The input to this function are the external four- momenta p{0 : 3, 1 : 4) and the helicities of all fermions rihei(l ■ 4) in 
the process. Remember that helicity and chirality are identical only for massless fermions because chirality is defined 
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as the eigenvalue of the projectors (1 ± 75)72, while helicity is defined as the projection of the spin onto the 
momentum direction, i.e. as the left or right handedness. The exact definition of these two properties we give in 
Section ??. The entries of ni,ei will be either +1 or —1. For each point in phase space and each helicity combination 
the MadGraph subroutine SUUBJMUPMUM computes the matrix element using standard Helas routines . 

• IXXXXX(p, m, rihei, risf, F) computes the wave function of a fermion with incoming fermion number, so either 
an incoming fermion or an outgoing anti-fermion. As input it requires the four-momentum, the mass and the 
helicity of this fermion. Moreover, risf = +1 marks the incoming fermion u and risf = —1 the outgoing 
anti-fermion because by convention MadGraph defines its particles as u and 

The fermion wave function output is a complex array F(l : 6). Its first two entries are the left-chiral part of the 
fermionic spinor, i.e. ^'(l : 2) = (1 - 75)72 u or F{1 : 2) = (1 - 75)72 v for risf = ±1. The entries F{3 : 4) 
are the right-chiral spinor These four numbers can directly be computed from the four-momentum if we know 
the helicity. The four entries correspond to the size of one 7 matrix, so we can compute the trace of the chain of 
ganuna matrices. Because for massless particles hehcity and chirality are identical our quarks and leptons will 
only have finite entries F{1 : 2) for Uhei = —1 and F(3 : 4) for nhei = +1. 

The last two entries of F contain the four-momentum in the direction of the fermion flow, namely 

F(5) = n,f(p(0) + zp(3)) and F(6) = n,f{p{l) + ip{2)). 

■ OXXXXX(p, m, rihei, ^sf, F) does the same for a fermion with outgoing fermion flow, i.e. our incoming u and 
our outgoing The left-chiral and right-chiral components now read F(l : 2) = m(1 — 75)72 and 

F(3 : 4) = m(1 + 75)72, and similarly for the spinor v. The last two entries are F(5) = nsf(p(0) -|- ip(3)) and 

F(6) = n,f(p(l) + ip{2)). 

■ J10XXX(Fi, Fo, g, m, T, Jio) computes the (off-shell) current for the vector boson attached to the two external 
fermions Fi and Fg. The coupUng ^'(1 : 2) is a complex array with the interaction of the left-chiral and 
right-chiral fermion in the upper and lower index. For a general Breit-Wigner propagator we need to know the 
mass TO and the width T of the intermediate vector boson. The output array J^o again has six components which 
for the photon with momentum q are 

J,„(M + l) = -^i^j7'^(5(l)^^^ +5(2)^^) M = 0,l,2,3 

J^o(5) = -F,{^) + F,{^) ^ -p.(0) +po(0) +i (-p,(3) -po(3)) 

Jio{Q) = -Fi{6) + Fo{6) ~ -pi{l)+Po{l)+i{-Pi{2)+po{2)) . (3.63) 

The first four entries in J^o correspond to the index /i or the dimensionaUty of the Dirac matrices in this vector 
current. The spinor index is contracted between and Fi. 

As two more arguments Jio includes the four-momentum flowing through the gauge boson propagator. They 
aUow us to reconstruct from the last two entries 

g^ = (ReJ,„(5),ReJ„(6),lmJ,<,(6),lmJ,„(5)) . (3.64) 

• IOVXXX(Fi, Fo,J,g,V) computes the amplitude of a fermion-fermion-vector coupUng using the two external 
fermionic spinors Fj and Fo and an incoming vector current J which in our case comes from JIOXXX. Again, 
the coupling g{l : 2) is a complex array, so we numericaUy compute 

Fll(9il)'-^+9{2)'-V^]Fi. (3.65) 



All spinor and Dirac indices of the three input arguments are contracted in the final result. Momentum 
conservation is not enforced by lOVXXX, so we have to take care of it by hand. 

Given the list above it is easy to follow how MadGraph computes the amplitude for uu 7* fi^fjr. First, it caUs 
wave functions for all external particles and puts them into the array W{1 : 6, 1 : 4). The vectors W{*,1) and 
W{*, 3) correspond to Fi{u) and Fi(;U+), while W{*, 2) and W{*, 4) mean Fo{u) and Fo{ij,~). 
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The first vertex we evaluate is the incoming quark-photon vertex. Given the wave functions Fi = W{*, 1) and 
Fo = W{*, 2) JIOXXX computes the vector current for the massless photon in the s channel. Not much changes if we 
instead choose a massive Z boson, except for the arguments m and F in the JIOXXX call. Its output is the photon 
current Jjo = W{*, 5). 

The last step combines this current with the two outgoing muons coupling to the photon. Since this number gives the 
final amplitude, it should return a complex number, not an array. MadGraph calls lOVXXX with Fi = W{*, 3) and 
Fo = W{*, 4), combined with the photon current J = W{*, 5). The result AMP is copied into JAMP without an 
additional sign which could have come from the relative ordering of extemal fermions in different Feynman diagrams 
contributing to the same process. 

The only remaining sum left to compute before we square JAMP is the color structure, which in our simple case 
means one color structure with a color factor Nc = 3. 

As an added bonus MadGraph produces a file with all Feynman diagrams in which the numbering of the external 
particles corresponds to the second argument of W and the numbering of the Feynman diagrams corresponds to the 
argument of AMP. This helps us identify intermediate results W, each of which is only computed once, even if is 
appears several times in the different Feynman diagrams. 

As mentioned above, to calculate the transition amplitude MadGraph requires all masses and couplings. They are 
transferred through common blocks in the file coupl.inc and computed elsewhere. In general, MadGraph uses unitary 
gauge for all vector bosons, because in the helicity amplitude approach it is easy to accommodate compUcated tensors, 
in exchange for a large number of Feynman diagrams. 

The function UUBJMUPMUM described above is not yet the full story. When we square A4 symbolically we need to 
sum over the spins of the outgoing states to transform a spinor product of the kind uu into the residue or numerator of 
a fermion propagator. To obtain the fuU transition amplitude nimierically we correspondingly sum over all 
heUcity combinations of the extemal fermions, in our case 2^ = 16 combinations. 



SUBROUTINE SUUB_MUPMUM (PI , ANS) 

C 

C FUNCTION GENERATED BY MADGRAPH 

C RETURNS AMPLITUDE SQUARED SUMMED/AVG OVER COLORS 

C AND HELICITIES FOR THE POINT IN PHASE SPACE P ( : 3 , NEXTERNAL ) 

C 

C FOR PROCESS : u u" -> mu+ mu- 
C 

INTEGER NEXTERNAL, NCOMB, 

PARAMETER (NEXTERNAL=4 , NCOMB= 16) 
INTEGER THEL 
PARAMETER (THEL=NC0MB*1) 

REAL*8 PI (0:3, NEXTERNAL ), ANS 

INTEGER NHEL (NEXTERNAL, NCOMB) , NTRY 
REAL* 8 T, UUB_MUPMUM 
INTEGER IHEL, IDEN, IC (NEXTERNAL) 
INTEGER IPROC, JC (NEXTERNAL) 
LOGICAL GOODHEL (NCOMB) 

DATA GOODHEL/THEL* .FALSE. / 
DATA NTRY/0/ 
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DATA (IDEN (IHEL) , IHEL= 1, 1) / 36/ 
NTRY=NTRY+1 



DO IHEL=1, NEXTERNAL 

JC(IHEL) = +1 
ENDDO 



DO IHEL=1,NC0MB 

IF (GOODHEL (IHEL, IPROC) .OR. NTRY . LT . 2) THEN 
T = UUB_MUPMUM (PI, NHEL (1, IHEL) , JC (1) ) 
ANS = ANS + T 

IF (T .GT. ODO .AND. .NOT. GOODHEL ( IHEL, IPROC ) ) THEN 

GOODHEL (IHEL, IPROC) = . TRUE . 
END IF 
ENDIF 
ENDDO 

ANS = ANS/DBLE (IDEN) 
END 



The important part of this subroutine is the list of possible helicity combinations stored in the array 7ihei(l : 4, 1 : 16). 
Adding all different helicity combinations means a loop over the second argument and a call of UUBJMUPMUM with 
the respective helicity combination. Because of the naive helicity combinations many are not allowed the array 
GOODHEL keeps track of valid combinations. After an initialization to all 'false' this array is only switched to 'true' if 
UUBJMUPMUM returns a finite value, otherwise MadGraph does not waste time to compute the matrix element. At the 
very end, a complete spin-color averaging factor is included as IDEN and in our case given by 2 x 2 x N'^ = 36. 

Altogether, MadGraph provides us with the subroutine SUUB_MUPMUM and the function UUB_MUPMUM which 

together compute \A4\^ for each phase space point, i.e. for each external momentum configuration. This helicity 
method might not seem particularly appealing for a simple (2 — > 2) process, but it makes it possible to compute 
processes with four and more particles in the final state and typically up to 10000 Feynman diagrams which we could 
never square symbolically, no matter how many graduate students' live times we throw in. 



3.4 Missing transverse energy 

Some of the most interesting signatures at the LHC involve dark matter particles. From cosmological constraints we 
know that dark matter definitely interacts gravitationally and that it cannot carry electromagnetic or color charges. 
Weak interactions are allowed because of their limited reach. It turns out that a weakly interacting particle with a mass 
around the electroweak scale typically give the observed relic density in the universe. This is called the 
WIMP miracle . It it the reason why in modern TeV-scale model building every model (and it's dog) predict a stable 
WIMP. From supersymmetry we know that this is not hard to achieve: all we need is a Z2 symmetry, i.e. a 
multiplicative quantum number for (a sector of) newly introduced particles. In supersymmetry this is called R parity, 
in little-Higgs models T parity, and in extra dimensional models Kaluza-Klein parity. 



At the LHC we typically produce strongly interacting new particles, provided they exist. If a conserved dark matter 
quantum number exists we will always produce them in pairs. Each of them decays to the weakly interacting sector 
which includes a stable dark matter agent. On the way, the originally produced particles have to radiate quarks or 
gluons to get shed their color charge. If in some kind of cascade decays they also radiate leptons or photons those can 
be very useful to trigger on and to reduce QCD backgrounds, but this depends on the details of the weakly interacting 
new physics sector. The decay steps ideally are two body decays from on-shell particle to on-shell particle, but they do 
not have to be. What we therefore look for is jets in association with pairs of only weakly interacting, hence invisible 
particles in the ATLAS and CMS detectors. 



From Eq.(2.28 1 and the discussion of parton densities we remember that at hadron colliders we do not know the 
kinematics of the initial state. In beam direction we only know its boost statistically, whereas in the transverse plane 
by definition the incoming partons have zero momentum. The way to look for invisible particles therefore is a 
mis-balance of three-momentum in the transverse plane. The actual observable is the missing transverse momentum, 
i.e. the vector sum of the transverse momenta of all invisible particles. We can convert it into a missing transverse 
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Figure 15: Left: missing energy distribution from the early running phase of the DZero experiment at the Tevatron. 
Figure from Beate Heinemann. Right: corrected missing energy distribution in QCD events at ATLAS using only data 
from April/May 2010 at 7 TeV collider energy. Figure from Ref. ll45l . 



energy which in the absence of any information on particle masses is defined as the absolute value of the 
two-dimensional missing momentum vector. LHC events including dark matter are characterized by a high jet 
multiplicity and large missing transverse energy. 

At the end of Section 2.5.3 we focus on the proper simulation of W+jets and Z+jets samples, which are the Standard 
Model backgrounds to such signals. It will turn out that jet merging is needed to reliably predict the missing 
transverse momentum distributions in Standard Model processes. After all our studies in Section|2]we are at least 
theoretically on safe ground. However, this is not the whole story of missing transverse momentum. 



3.4.1 Measuring missing energy 



The left panel of Figure 15 is a historic missing transverse energy distribution from DZero. It nicely illustrates that by 
just measuring missing transverse energy, Tevatron would have discovered supersymmetry based on two beautiful 
peaks around 150 GeV and around 350 GeV. However, this preliminary experimental result has nothing to do with 
physics, it is purely a detector effect. 

The problem of missing energy we can illustrate using a simple number: to identify and measure a lepton we need 
around 500 out of 200000 calorimeter cells in an experiment like ATLAS, while for missing transverse energy we 
need all of them. To cut on a variable like missing transverse momentum we need to understand our detectors really 
well, and for this level of understanding we need time. 



There are several sources of missing energy which we have to understand before we get to search for new physics: 

- First, we have to subtract bad runs. They happen if for a few hours parts of the detector might not have worked 
properly. We can identify such bad runs by looking at Standard Model physics, Uke gauge bosons, and remove 
them from the data sample. 

- Next, there will be coherent noise in the calorimeter. With 200000 cells we know that some of them will 
individually fail or produce noise. Some sources of noise, like leaking voltage or other electronic noise can be 
correlated geometrically. It will lead to beautiful missing momentum signals. In the same spirit, there might 
also be particles crossing our detector, but not coming from the interaction point. They can be cosmic rays or 
errand beam radiation, and they will lead to unbalanced energy deposition as well. Again, the way to get rid of 
such noise is looking at Standard Model candles and removing sets of events where problems occur 

- The third class of fake missing energy is failing calorimeter cells, like continuously hot cells or dead cells, 
which can be removed for good once we know the detector really well. 
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- While not really a detector fake the main source of missing energy at hadron colliders are mis-measured QCD 
jets. If parts of jets point into regions with poor calorimetry, like support structures, their energy will be wrongly 
measured, and the corresponding QCD event will show missing transverse energy. One way to tackle this 
problem is to require a geometric separation of the missing momentum vector and all jets in the event. 
Nevertheless, as a rule of thumb, up to 0(1%) of all hard QCD events still lead to more than 100 GeV of fake 
missing transverse energy. Figure |5] shows that this is not at all a negligible number. 

Once we understand all sources of fake missing momentum we can focus on real missing momentum. This missing 
transverse momentum we compute from the momenta of all tracks seen in the detector. This means that any 
uncertainty on these measurements, like the jet or lepton energy scale will smear the missing momentum. Moreover, 
we know that there is for example dead matter in the detector, so we have to compensate for this. This compensation 
is a global correction to individual events, which means it will generally smear the missing energy distribution. The 
right panel of Figure[T5]shows a very early missing transverse energy distribution of ATLAS after some of the 
corrections described above. 

To simulate a realistic missing transverse momentum distribution at the LHC we have to smear all jet and lepton 
momenta, and in addition apply a Gaussian smearing of the order 



While this sounds like a trivial piece of information 1 cannot count the number of papers where people forget this 
smearing and discover great channels to look for Higgs bosons or new physics. They fall apart when experimentaUsts 
take a careful look. The simple rule is: phenomenological studies are right or wrong based on if they can be 
reproduced by real experimentalists and real detectors or not. 

3.4.2 Missing energy in the Standard Model 

In the Standard Model there exists a particle which only interacts weakly: the neutrino. At colliders we produce them 
in reasonably large numbers in W decays. This means that in W+ jets production we can learn how to reconstruct the 
mass of a leptonically decaying W, i.e. one observed and one missing particle. We construct a transverse mass in 
analogy to an invariant mass, but neglecting the longitudinal momenta of the decay products 



in terms of a transverse energy — Pr ~^ Since the transverse mass is always smaller than the actual W mass 
and reaches this limit for realistic phase space regions we can extract mw from the upper edge in the nix^w 
distribution. Obviously, we can define the transverse mass in many different reference frames. However, its value is 
invariant under — or better independent of — longitudinal boosts. Moreover, given that we construct it as the 
transverse projection of an invariant mass it is also invariant under transverse boosts. By construction we cannot 
analyze the transverse mass event by event, so this W mass measurement only uses the fraction of events which 
populate the upper end of the transverse mass distribution. 

Alternatively, from single top production and the production of mixed leptonically and hadronically decaying top pairs 
we know another method to conditionally reconstruct masses and momenta involving one invisible particle: from a 
leptonically decaying top quark we only miss the longitudinal momentum of the neutrino. On the other hand, we 
know at least for the signal events that the neutrino and the lepton come from an on-shell W boson, so we can use this 
on-shell condition to reconstruct the longitudinal neutrino momentum under the assumption that the neutrino have 
zero mass. Recently, we have seen that sufficiently boosted top quarks with leptonic decays can be fully reconstructed 
even without using the measured missing energy vector. Instead, we rely on the W and t on-shell conditions and on an 
assumption about the neutrino momentum in relation to the bottom-lepton decay plane. 




(3.66) 




.miss 

+ ™mi,ss + 2 {Et/Etm^^s - PT,e ■ PTMss 



(3.67) 
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From Higgs searches we know how to extend the transverse mass to two leptonic W decays, i.e. two neutrinos in the 
final state. The definition of this transverse mass 

= ruu + m^i,, + 2 {EtmEtms^ - Pt.u ■ Ptms^) (3.68) 

is not unique because it is not clear how to define m^i^^, which also enters the definition of i?T,miss- From Monte Carlo 
studies it seems that identifying m^iss = mu, which is correct at threshold, is most strongly peaked. On the other 
hand, setting nimiss = to define a proper bounded-from-above transverse mass variable seems to improve the Higgs 
mass extraction. 



For an unspecified number of visible and invisible particles in the final state there also exist global observable we can 
rely on. The visible mass it is based on the assumption that we are looking for the decay of two heavy new states 
where the parton densities will ensure that these two particles are non-relativistic. We can then approximate the 
partonic energy mi + m2 by some kind of visible energy. If the heavy states are produced with little energy, 

boost invariance is not required for this construction. Without taking into account missing energy and adding leptons £ 
and jets j the visible mass looks like 



'visible 



E 

e,3 



(3.69) 



Similarly, the Tevatron experiments have for a long time used an effective transverse mass scale which is usually 
evaluated for jets only, but can trivially be extended to leptons: 



Pt 



(3.70) 



where the last step assumes that all final state particles are massless. In an alternative definition of Hr we sum over a 
number of jets plus the missing energy and skip the hardest jet in this sum. 

When combining such a measure with missing transverse momentum the question arises: do we want to pair up the 
missing transverse momentum with the visible transverse momenta or with the complete visible momenta? For 
example, we can use the scalar sum of all transverse momenta in the event, now including the missing transverse 
momentum 

meff = ^ ^ PT ■ (3.71) 



E 

^, J, miss 



^, J, miss 



This effective mass is known to trace the mass of the heavy new particles decaying for example to jets and missing 
energy. This interpretation relies on the non-relativistic nature of the production process and our confidence that all 
jets included are really decay jets. 



3.4.3 Missing energy and new physics 

The methods described in the last section are well studied for different Standard Model processes and can be applied 
in new physics searches for various lengths of decay chains. However, there is need for one significant modification, 
namely to account for a finite unknown mass of the carrier of the missing energy. This is a problem of relativistic 
kinematics and at leading order does not require any knowledge of QCD or new physics models. 



The chain of three successive three-body decays shown in Figure 16 is the typical left handed squark cascade decay in 
supersymmetry. The same topology we can interpret in extra-dimensional models (universal extra dimensions or 
UED) as the decay of a Kaluza-Klein quark excitation 

qL^ Xlq^i^t^q-^ Xl^^f-'q Q^l'^ Z^^\ l'^^^^ q r q . (3.72) 



In both cases the last particle in the chain, the lightest neutralino or the Kaluza-Klein photon excitation pass the 
detectors unobserved. The branching ratio for such decays might not be particularly large; for example in the 
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Figure 16; Feynman diagram for the long decay chain shown in Eq.( |3.72| l. 

supersymmetric parameter point SPSla with a mass spectrum we will discuss later in Figure |20]the long squark decay 
ranges around 4%. On the other hand, strongly interacting new particles should be generously produced at the LHC, 
so we usually assume that there will be enough events to study. The question is how we can then extract the four 
masses of the new particles appearing in this decay from the three observed external momenta. 



The proposals to solve this problem can be broadly classified into four classes. While all of them should in principle 
work and would then differ mostly by statistics, only for the first strategy we know how QCD and detector smearing 
affect them. 

1. Endpoint methods extract masses from lower (threshold) and upper (edge) kinematic endpoints of invariant 
mass distributions of visible decay products. This method is best suited for long decay chains, where the 
number of independent endpoint measurements in one leg at least matches the number of unknown masses in 
the cascade. An implicit assumption of these endpoint techniques is that the form of the matrix element 
populates the phase space close to the endpoint well. Otherwise, the endpoint will be soft and difficult to 
identify on top of the continuum background. 



The squark decay Eq.( 3.72 1 has a particular kinematic feature: the invariant mass distributions of the two 
leptons mee. Looked at from the rest frame of the intermediate slepton it is a current-current interaction, similar 
to the Drell-Yan process computed in Eq.ljZTTJ. Because in the s channel there now appears a scalar particle 
there cannot be any angular correlations between the two currents, which means the meg distribution will have a 
triangular shape. Its upper limit, called the dilepton edge, we can compute: in the rest frame of the scalar lepton 
the three-momenta of the incoming and outgoing pair of particles have the absolute values 
\p\ = |"^|o — "^||/ (2m^). The lepton mass we set to zero. The invariant mass of the two lepton reaches its 
maximum if the two leptons are back-to-back, i.e. the scattering angle is cos 9 = —1 

^2 



w« = {Pi+ + Pe- ] 

= 2 {E,^E,- - |p,+ ||p,-|cos0) 
<2 {E,^E,-+\p,+ \\p,-\) 

= 4^1 ^ using El^^f^i. (3.73) 

2m^ 2to£ 

This kinematic statement is independent of the shape of the mu distribution. For the particle assignments 
shown in Eq. p.72[ ) the kinematic endpoint are given by 

(m|o-mf)(mf-m|o) Kd) - ^,^(i))Ka) " "^^(d) ^. 

< TO„ < -2 < TO« < -2 . (3.74) 

"''I ' 1(1) 

A problem in realistic applications of endpoint methods is combinatorics. We need to either clearly separate the 
decays of two heavy new states, or we need to combine a short decay chain on one side with a long chain on the 
other side. In supersymmetry this is naturally the case for associated squark and gluino production. A right 
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Figure 17: Example for the mass relation method using three successive two-body decays on both sides of the events 
(left). After detector smearing we can reconstruct the masses for the supersymmetric parameter point SPSla with the 
squark decay chain shown in Eq.( 3.72 1 (right). Figure from Ref. 



handed squark often decays directly to the lightest neutralino which is the dark matter candidate in the model. 
The gluino has to radiate two quark jets to reach the weakly interacting sector of the model and can then further 
decay in many different ways. In other models this feature is less generic. The impressive potential of endpoint 
methods in the case of supersymmetry we will illustrate later in this section. 

When looking at long cascade decays for example with two leptons we usually cannot tell which of the two 
leptons is radiated first. Therefore, endpoint techniques will always be plagued with combinatorial background 
from the mapping of the particle momenta on the decay topology. The same applies to QCD jet radiation vs 
decay jets. In this situation it is useful to consider the correlation of different invariant masses and their 
endpoints. The endpoint method can be extended to use invariant mass distributions from both sides of the event 
(hidden threshold techniques), and correlations between the distributions from each leg (wedgebox techniques). 



2. Mass relation methods generalize the single-top example in Section 3.4.2 and completely reconstruct the 
kinematics event by event. For each event this construction provides a set of kinematic constraints. While for 
one event the number of parameters can be larger than the number of measurements, adding signal events 
increases the number of measurements while keeping the number of unknowns constant. Eventually, the system 
of equations will solve, provided all events are really signal events. Implicitly, we always assume that all 
decaying particles are on-shell, but off-shell effects we do not expect to have a huge effect on the results at the 
LHC. 

In FigureflTlwe show the general topology of a three-step cascade decay on each side of the event, like we 



expect it for a pair of left handed squarks following Eq.( 3.72 1. To extract the masses of the new particles we 
need to solve the system of equations 

(Pl +P2 +P3+P4f^ m% 
iP2 +P3 +P4)^= my 

{P3+P4f= mj. 



pI- 



I'M 



(3.75) 



for each side of the event. For each event there are eight unknown masses and six unknown missing momentum 
components of which we measure two combinations as the missing transverse momentum. All of these 12 
unknowns we can determine if we add a sufficiently large number of events. 



One strategy to solve this problem is to assume eight test masses m — [ra 



), use the three 



first equations in Eq.( 3.75 1 for each event plus the two missing transverse momentum components to determine 
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Figure 18: Simulations for different rriTx, for the decay xt ^ Xi^ or 
the two-body decay. Figure from Ref. Ii47i . 



Xie'^v. The blue mT2 line applies to 



both missing four-momenta and test the consistency of this solution using the last line of Eq. p.75[ ) for each of 
the two legs. In this consistency test we combine the information from several events. 

The first three lines in Eq.( 3.75| l we can conveniently solve for the missing momentum 



2(PlJ34) = 


Si 


= my 


2(P2P4) = 


S2 


= m\ 


2(P3P4) = 


S3 





2(piP2) 
- 2(P2P3) 



2(P1P3) 



(3.76) 



for simplicity assuming massless Standard Model decay products. Similarly, we define the measured 
combinations 55.6,7 from the opposite chain. In addition, we measure the missing transverse momentum as 
VTA + VTfi = (s4, ss)- Using the first equal sign in Eq.(3.76 1 we we can collect the two missing four-momenta 

(P4 



-VT 
into pmiss = 



E4,ps, Es) and express this vector in terms of measured quantities and masses in the cascade 
A ■ Pmiss = s. The matrix A includes only components of measured momenta and is almost block diagonal, so it 
can be inverted. Following the second equal sign in Eq.(3.76 1 we can then write s = B ■ m + c, where the matrix 
B only contains non-zero entries ±1 and the vector c consists of measured quantities. Together, this gives us 

p^i,, ^ A-h A-^ B m + A-^ c . (3.77) 

The result for all masses in the decay chain using 25 events per set and including all combinatorics we show in 



Figure 17 The mass relation method has also been successfully applied to single legs as well as in combination 



with kinematic endpoints. 

MT2 methods are based on a global variable mx2- It generalizes the transverse mass known from W decays to 
the case of two massive invisible particles, one from each leg of the event. The observed missing energy in the 
event we can divide into two scalar fractions pr.miss — Qi + q2- Given the two fractions qj we can construct a 
transverse mass for each side of the event, assuming we know the invisible particle's mass rnxjigj; frimiss); the 
second argument is an external assumption, i.e. rrimiss is an assumed value for m^iss. 

Inspired by the usual transverse mass we are interested in a mass variable with a well-defined upper edge, so we 
need to construct some kind of minimum of rriTj as a function of the splitting of pT,miss- Naively, this minimum 
will simply be the zero transverse momentum limit of mx on one leg, which is not very interesting. On the 
other hand, in this case the transverse mass from the other leg reaches a maximum, so we can instead define 



mm 

PT, miss = 9l +92 



max mT^jiqfMmiss) 



TOT2("lmiss) 

as a function of the unknown missing particle mass. There are a two properties we know by construction 

"llight + TOmiss < nT-T2("^miss) 

< mT2("lniiss) < TOheavy 



(3.78) 



(3.79) 
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m(ll) (GeV) 



Figure 19: Invariant mass of two leptons after selection cuts for the SPS la parameter point: SUSY signal Opposite Sign 
Same Flavor (OS-SF): full line; SUSY signal Opposite Sign Opposite Flavor (OS-OF): dotted line; Standard Model 
background: grey. Figure from Giacomo Polesello (ATLAS). 



The first line means that each of the rriT.j lie between the sum of the two decay products' masses and the mass 
of the decaying paiticle, so for massless Standard Model decay products there will be a global 171x2 threshold at 
the missing particle's mass. 

Moreover, for the correct value of m^nKs the mT2 distribution has a sharp edge at the mass of the parent particle. 
In favorable cases tot2 may allow the measurement of both the parent particle and the LSP based on a 



single-step decay chain. These two aspects for the correct value jfimiss — ^Timiss we can see in Figure 18 the 
lower threshold is indeed given by 771^2 — n^xl ~ while the upper edge of mT2 ~ rn^o coincides with the 
dashed line for m + — m^o. 

An interesting aspect of mT2 is that it is boost invariant if and only if mmiss = "^miss- For a wrong assignment of 
TOmiss it has nothing to do with the actual kinematics and hence with any kind of invariant, and house numbers 
are not boost invariant. This aspect we can exploit by scanning over mmiss and looking for so-called kinks, i.e. 
points where different events kinematics all return the same value for mx2- 

Similar to the more global Weff variable we can generalize to the case where we do not have a clear 
assignment of the two decay chains involved. This modification Mrcen again has an upper edge, which 
unfortunately is not as sharp as the one in rnT2- Similarly, the procedure can be generalized to any one-step 
decay, for example a three-body decay with either one or two missing particles on each side of the event. Such 



Mtx distributions are useful as long as they have a sharp enough edge, as illustrated in Figure 18 



4. Extreme kinematics can also give us a handle on mass reconstruction from an incomplete set of observables. 
One such phase space region are points close kinematic endpoints where particles are produced at rest. Other 
examples are the approximate collinear Higgs mass reconstruction in a decay to boosted tau pairs described in 



Section 1.5.3 or the boosted leptonic top decays mentioned before. 



The way mass measurements can lead to proper model reconstruction we sketch for one example. The classic example 
for the endpoint method is the long supersymmetric left handed squark decay chain shown in Eq. p.72| i and in 
Figure 16 When we use such kinematic endpoints or other methods to extract mass parameters it is crucial to start 
from a signal-rich sample to avoid combinatorics and washed out endpoints vanishing in a fluctuating or even 
sculptured background. For jets, leptons and missing energy a major background will be top pairs. The key 
observation is that in long cascade decays the leptons are flavor-locked, which means the combination 
e+e^ + /i^/i^ — fi^ — e+/i^ becomes roughly twice 11^ fJ,^ for the signal, while it cancels for top pairs. Using 
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Sparticle masses and mass differences [GeV] 



Figure 20: Masses extracted from the gluino-sbottom decay chain, including estimated errors. The faint blue lines 
indicate wrong solutions when inverting the endpoint-mass relations. The supersymmetric mass spectrum is given by 
the SPS la parameter point. Figure from Ref. 1481 . 



such a combination for the endpoint analysis means the top background is subtracted purely from data, as illustrated in 



Figure 19 



The long squark decay in SPS la-like parameter points with squark masses in the 500 to 600 GeV range has an 
important advantage: for a large mass hierarchy we should be able to isolate the one decay jet just based on its energy. 



In complete analogy to the dilepton edge shown in Eq.( 3.74 1, but with somewhat reduced elegance we can measure 
the threshold and edge of the £^£^q distribution and the edges of the two £^q combinations. Then, we solve the 
system for the intermediate masses without any model assumption, which allows us to even measure the dark matter 
mass to 0(10%). The limiting factors will likely be our chances to observe enough endpoints in addition to to™" and 
the jet energy scale uncertainty. An interesting question is how well we will do with tau leptons, where the edge is 
softened by neutrinos from tau decays. 

Provided the gluino or heavy gluon is heavier than the squarks or heavy quarks we can measure its mass by extending 
the squark chain by one step: g — > qq. This measurement is hard if one of the two jets from the gluino decay is not 
very hard, because its information will be buried by the combinatorial error due to QCD jet radiation. The way around 
is to ask for two bottom jets from the strongly interacting decay: g bb* or G'^^-' — t- bB^^K The summary of all 



measurements in Figure 20 shows that for example the gluino mass we can extract at the per-cent level, a point at 
which we might have to start thinking about off-shell propagators and at some point even define what exactly we mean 
by 'masses as appearing in cascade decays'. 

A generic feature or all methods relying on decay kinematics is that it is easier to constrain the differences of squared 
masses than the absolute mass scale. This is also visible in Figure [20l It is due to the form of the endpoint formulas 
which involve the difference of mass squares mf — m| = (toi + TO2)(toi — TO2). This combination is much more 
sensitive to (toi — TO2) than it is to (mi + 7712). Experimentally, correlated jet and lepton energy scale uncertainties 
do not make life easier either. Nevertheless, the common lore that kinematics only constrain mass differences is 
obviously not true for two body decays. 

Alternatively, we can use the same gluino decay to first reconstruct the intermediate neutralino or Kaluza-Klein Z 
momentum for lepton pairs residing near the mu edge. In that case the invisible heavy state is produced 
approximately at rest, and the momenta are correlated as 

P^o = 1 - ^ pu Pz(i) = 1 - m (3.80) 



3.4 Missing transverse energy 
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Figure 21: Asymmetry in mjg/m^l^ for supersymmetry (dashed) and universal extra dimensions (solid). The spectrum 
is assumed to be hierarchical, typical for supersymmetric theories. Figure taken from Ref. 



If both neutralino masses (or the Kaluza-Klein photon and Z masses) are known we can extract the sbottom 
(Kaluza-Klein bottom) and gluino (Kaluza-Klein gluon) masses by adding the measured bottom momenta to this 



neutralino (Kaluza-Klein photon) momentum. Again, for the mass spectrum shown in Figure 20 we can measure the 
gluino mass to few per-cent, depending on the systematic errors. 

For a complete analysis, kinematic endpoints can be supplemented by any other method to measure new physics 
masses. For short decay chains mT2 is best suited to measure the masses of particles decaying directly to the dark 
matter agent. In supersymmetry, this happens for right handed sleptons or right handed squarks. The issue with short 
decay chains is that they often require on some kind of jet veto, which following Sections 2.5.3 and 1.5.2 is 
problematic for low-pT jets. 

Keeping in mind that endpoint analyses only use a small fraction of the events, namely those with extreme kinematics, 
an obvious way to improve their precision is to include the complete shape of the invariant mass distributions. 
However, this strategy bears a serious danger Invariant masses are just an invariant way of writing angular 
correlations between outgoing particles. Those depend on the spin and quantum numbers of all particles involved. For 
example, in the case of the m« endpoint the triangular shape implies the absence of angular correlations, because the 
intermediate particle is a scalar. This means that we should be careful when extracting information for example from 
kinematic endpoints we do not observe. Depending on the quantum numbers and mixing angles in the new physics 
scenario, kinematic endpoints can for example be softened, so they vanish in the background noise. 

This argument we can turn around. Measuring discrete quantum numbers, like the spin of new particles, is hard in the 
absence of fully reconstructed events. The usual threshold behavior is not observable at hadron colliders, in particular 
when the final state includes missing transverse energy. Instead, we rely on angular correlation in decays. For the 



squark decay chain given in Eq.( 3.72 1 there exists a promising method to simultaneously determine the spin of all new 
particles in the chain: 

1. Instead of trying to measure spins in a general parameterization we start from the observation that cascade 
decays radiate particles with known spins. This is most obvious for long gluino decays where we know that the 
radiated bottom quarks as well as muons are fermions. The spins inside the decay chain can only alternate 
between fermions and bosons. Supersymmetry switches this fermion/boson nature compared to the 
corresponding Standard Model particle, so we can contrast it with another hypothesis where the spins in the 
decay chain follow the Standard Model assignments. Such a model are Universal Extra Dimensions, where each 
Standard Model particle acquires a Kaluza-Klein partner from the propagation in the bulk of the additional 
dimensions. 

2. Thresholds and edges of all invariant masses of the radiated fermions are completely determined by the masses 
inside the decays chain. Kinematic endpoints cannot distinguish between supersymmetry and universal extra 
dimensions. In contrast, the shape of the distribution between the endpoints is nothing but an angular correlation 
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in some reference frame. For example, the rriji distribution in principle allows us to analyze spin correlations in 
squark decays in a Lorentz invariant manner The only problem is the link between £^ and their ordering in 
decay chain. 

a proton-proton collider like the LHC produces considerably more squarks than antisquarks in the 
squark-gluino associated channel. For the SPSla spectrum at 14 TeV collider energy their ratio ranges around 
2: 1. A decaying squark radiates a quark while an antisquark radiates an antiquark, which means that we can 
define a non-zero production-side asymmetry between mji+ and niji- . Such an asymmetry we show in 



Figure 21 for the SUSY and for the UED hypotheses. Provided the masses in the decay chain are not too 



degenerate we can indeed distinguish the two hypotheses. 

This basic idea has since been applied to many similar situations, like decays including gauge bosons, three-body 
decays, gluino decays with decay-side asymmetries, cascades including charginos, weak-boson-fusion signatures, etc. 
They show that the LHC can do much more than just discover some kind of particle beyond the Standard Model. It 
actually allows us to study underlying models and symmetries. 



3.5 Uncertainties 

As we argue in the very beginning of the lecture, LHC physics always means extracting signals from often large 
backgrounds. This means, a correct error estimate is crucial. For LHC calculations we are usually confronted with 
three types of errors. 

The first and easiest one are the statistical errors . For small numbers of events these experimental errors are described 
by Poisson statistics, and for large numbers they converge to the Gaussian limit. In that limit the number of standard 
deviations in terms of the number of signal and background events is = A'sig/V-^back- The two event numbers are 
proportional to the integrated luminosity C which means that the statistical significance in the Gaussian limit increases 
with In high energy physics five standard deviations above a known background we call a discovery, three sigma 
is often referred to as an evidence. The Poisson region is the only complication we encounter for statistical errors. It 
means that for small number of signal and background events we need more luminosity than the Gaussian limit 
suggests. 

The second set of errors are systematic errors, like the calibration of the jet and lepton energy scales, the 
measurements of the luminosity, or the efficiencies to for instance identify a muon as a muon. Some readers might 
remember a bunch of theorists mistaking a forward pion for an electron — that happened right around my TASI in 
1997 and people not only discovered supersymmetry but also identified its breaking mechanism. Of course, our 
experimentalist CDF lecturer told us that the whole thing was a problem of identifying a particle in the detector with 
an efficiency which does not have to zero or one. 

Naively, we would not assume that systematic errors follow a Gaussian distribution, but experimentally we determine 
efficiencies and scaling factors largely from well understood background processes. Such counting experiments in 
background channels like Z — ^ leptons do behave Gaussian. The only caveat is the shape of far-away tails, which 
often turn out to be higher than the exponentially suppressed Gaussian tails. 

Systematic errors which do not follow a Gaussian distribution we quantify in terms of = ^sig/^back which means 
they do not improve with increasing luminosity. Again, five standard deviations are required to claim a discovery, and 
once we are systematics dominated waiting for more data does not help. 

The third source of errors are theoretical errors . They are hardest to model because they are often dominated by 
higher-order QCD effects in fixed order perturbation theory. From Section|2]we know that higher order corrections for 
example to LHC production rates do not follow a naive power counting in Og but are enhanced by large logarithms. If 
we can compute or reliably estimate some higher order terms in the perturbative QCD series we call this a prediction. 
In other words, once we consider a statement about perturbative QCD a statement about its uncertainty we are 
probably only giving a wild guess. 

To model theoretical uncertainties it is crucial to realize that higher order effects are not any more likely to give a K 
factor of LO than 0.9 or LL In other words, likelihood distributions accounting for theoretical errors do not have a 
peak and are definitely not Gaussian. There is a good reason to choose the Gaussian short cut, which is that folding 
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three Gaussian shapes for statistical, systematic and theoretical errors gives us a Gaussian distribution altogether, 
which makes things numerically much easier But this approach assumes that we know much more about QCD than 
we actually do which means it is not conservative at all. 

On the other hand, we also know that theoretical errors cannot be arbitrarily large. Unless there is a very good reason, 
a K factor for a total LHC cross section should not be larger than something like two or three. If that happens we need 
to conclude that perturbative QCD breaks down, and the proper description of error bars is our smallest problem. In 
other words, the centrally flat theory probability distribution for an LHC observable has to go to zero for very large 
deviations from the currently best value. 

In the Bayesian picture we define individual likelihood distributions for the different errors and integrate over the 
corresponding uncertainties. This is where the convolution of several Gaussians comes from. The same thing we can 
do for a Gaussian experimental error and a flat theory errors. The integral returns a two-sided error distribution which 
still has a peak. 

An intuitively better solution which gives a centrally flat likelihood distributions for the combined errors is the 
Rfit scheme , used for example by CKMfitter or SFitter It is a frequentist of profile likelihood construction 
marginalizing the unknown errors. Such a marginalization of an unwanted direction in the parameter space of our 
likelihood is a mathematical problem: to keep the mathematical properties of a likelihood as a probability measure we 
prefer to integrate over unwanted directions. This is what the Bayesian pictures does. The downside is that any 
integration needs a measure, and a measure in model space or error space is not defined by physics. We therefore 
assume such a prior and have to test our final result by varying this prior in what we consider a reasonable range. 
A profile likelihood instead projects the best fitting point of the unwanted direction onto the new parameter space; for 
each binned parameter point in the (n — 1) -dimensional space we explore the nth direction which is to be removed 

„_i, x„). Along this direction we pick the best value and it with the lower-dimensional parameter point 
^{xi....^n-i) = 'C™''^"-'(a;i_. ..,„_!, Xn)- Such a projection avoids defining a measure but it does not maintain for 
example the normalization of the likelihood distribution. 

The Rfit scheme projects away a flat theory error as well as a Gaussian experimental error. For the center of the 
distribution this means we need to cut open the experimental Gaussian distribution and insert a flat theory piece. For 
the log-likelihood = — 2 log C given a set of measurements d and in the presence of a general correlation matrix C 
this gives us 

fO |d^_d-|<^f-) 

J J I (theo) 

'^'^-'^^ + ''^ - d, < -a**™) 

^(exp) ^ » ^ (3 81) 

(theo) 

(theo) 



Xd,: 



^(exp) 



> cr- 



This distribution implies that for very large deviations there will always be tails from the experimental errors, so we 
can neglect the impact of the theoretical errors on this range. In the center the distribution is flat, reflecting our 
ignorance of the theory prediction. The impact of the size of the flat box we need to test. 

3.6 Further reading 

Again, there exist several good review articles with more in-depth discussions of different aspects touched in this 
Section: 

- as mentioned in Section|2]two very useful reviews of jet physics are available by Steve Ellis and 
coUaborators ||32l and by Gavin Salam ||33l . 

- for the general phenomenology of the heaviest Standard Model particles, the top quark, have a look at Sally 
Dawson's TASI lectures |50|. 

- if you use MadGraph/HELAS to compute helicity amplitudes there is the original documentation which 
describes every routine Il5l1 . 
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a lot of experimental knowledge on new physics searches well described and theoretically sound you can find in 
the CMS technical design report. Some key analyses are described in detail while most of the documentation 
focuses on the physics expectations ||52l . 

more on the magical variable 7717-2 you can find in an article by Alan Barr, Chris Lester and Phil Stephens ll47l . 
Chris Lester's thesis f53\ is a good point to start with. Recently, Alan Barr and Chris Lester published a broad 
review on techniques to measure masses in models with dark matter particles ||54l. 

as mentioned in the introduction, there is our more advanced review on new physics at the LHC which includes 
an extensive chapter on LHC signatures HI . 

a lot of insight into new physics searches at the LHC and at a linear collider you can find in a huge review 
article collected by Georg Weiglein 1551 . 

the pivotal work on determining spins in cascade decays is Jennie Smillie's PhD thesis ||49]| . On the same topic 
there exists a nicely written review by Liantao Wang and Itay Yavin 11561 . 

many useful pieces of information on mass extraction, errors, and the statistical treatment of new-physics 
parameter spaces you can find in the big SFitter publication 1,57] . The SFitter analysis of the Higgs sector ifTOl is 
very similar in structure, but different in the physics application. 
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